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(57) Abstract 

The invention relates to constructs comprising: a) a targeting sequence; b) a regulatory sequence; c) an exon; and d) an unpaired 
splice-donor site. The invention further relates to a method of producing protein in vitro or in vivo comprising the homologous recombination 
of a construct as described above within a cell. The homologously recombinant cell is then maintained under conditions which will permit 
transcription and translation, resulting in protein expression. The present invention further relates to homologously recombinant cells 
including primary, secondary, or immortalized vertebrate cells, methods of making the cells, methods of homologous recombination to 
produce fusion genes, methods of altering gene expression in the cells, and methods of making a protein in a cell employing the constructs 
of the invention. 
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DNA construct for effecting homologous recombination and uses thereof 



Background of t-.h» Tnv^< T 

Current approaches to treating disease by administer- 
ing therapeutic proteins include is vitro production of 
therapeutic proteins for conventional pharmaceutical 
delivery (e.g. intravenous, subcutaneous, or intramuscular 
injection) and, more recently, gene therapy. 

Proteins of therapeutic interest are generally pro- 
duced by introducing exogenous DNA encoding the protein of 
therapeutic interest into appropriate cells. For example, 
exogenous DNA encoding a desired therapeutic protein is 
introduced into cells, such as immortalized cells in a 
vector, such as a plasmid, from which the encoded protein 
is expressed. Further, it has been suggested that endoge- 
nous cellular genes and their expression may be modified 
by gene targeting. See for example, U.S. Patent No. 
5,272,071, WO 91/06666, WO 91/06667 and WO 90/11354. 
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Presently-available approaches to nOTO 

use of infectious vectors such . ^ 

which include the genet" • f rBtXWl » 1 Vect °~' 

tne y en etic material to be exm-ess^ o„~u 

=™ have limitations, such „ the ~. f 
S generatrng replication-competent virus during vector 
production recombination between the therapeutic virus 
and endogenous ratroviral genomes, potentially generating 

10 ZH', lnCreMed Ttatf -" Md =y"'o-icity.. indapan- 
tta ri k T Cl0n iDt ° 18196 ° £ LraasL" 

cionL insertion.1 avast.- limited 

clonmg capacrty i„ th , retrovirus (which restricts there- 
to appUcability, and short-Uved in ^ assail 
of the product of interest. A better approach to provid- 

Umit T ' PaXtiCUlarly ~ «"«* -ids i 

lrmrtatrona and risks essociatad with presently available 
methods, would be valuable. liable 

Summary ^> »).. IffiSBtlgj 

20 both inVeation rel »« « improved method, for 

for the i V teB Pr0dUC " OT ° £ th„apautic protein, and 

for the production and dellverv r.t .k. , 

gene ther.™ y„ °' Ilveiy o£ therapeutic proteins by 

a^IJrr PrMen,: ° ethod ' session of a 

nor' „ in a MU * endoge- 

« 31 9eM> " ° ltered * tte Production, by 

Presa i !:ced re rr binati0n ^ "» -""»« " « 

tor^ . ° £ WMl:h i ° Clud ' s " - Ea- 

tery sequence, an axon and a splice donor site. These 
components are introduced im-„ .v. .. 

DHA i„ «,,.>. . roaueM the chromosomal (genomic) 

DNA in auch a manner that this, in effect, results in 

re" let * ' t ~* 1 - »** «»ich the 
present r""' ~ — d °"« •**• 

s : i s a:~ - «— « - 

5 *" a result of introduction of these 
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components into the chromosomal DNA, the expression of the 
desired endogenous gene is altered. 

Altered gene expression, as used herein, encompasses 
activating (or causing to be expressed) a gene which is 
normally silent (unexpressed) in the cell as obtained, 
increasing expression of a gene which is not expressed at 
physiologically significant levels in the cell as 
obtained, changing the pattern of regulation or induction 
such that it is different than occurs in the cell as 
obtained, and reducing (including eliminating) expression 
of a gene which is expressed in the cell as obtained. 

The present invention further relates to DNA con- 
structs useful in the method of altering expression of a 
target gene. The DNA constructs comprise: (a) a targeting 
sequence; (b) a regulatory sequence; (c) an exon; and (d) 
an unpaired splice-donor site. The targeting sequence in 
the DNA construct directs the integration of elements 
(a) - (d) into a target gene in a cell such that the 
elements (b) - (d) are operatively linked to sequences of 
20 the endogenous target gene. In another embodiment, the 
DNA constructs comprise: (a) a targeting sequence, (b) a 
regulatory sequence, (c) an exon, (d) a splice-donor site, 
(e) an intron, and (f) a splice-acceptor site, wherein the 
targeting sequence directs the integration of elements 
25 (a) - (f) such that the elements of (b) - (f) are opera- 
tively linked to the endogenous gene. The targeting 
sequence is homologous to the preselected site in the 
cellular chromosomal DNA with which homologous recombina- 
tion is to occur. In the construct, the exon is generally 
30 3' of the regulatory sequence and the splice-donor site is 
3' of the exon. 

The following serves to illustrate two embodiments of 
the present invention, in which the sequences upstream of 
the human erythropoietin h(EPO) gene are altered to allow 
expression of hEPO in primary, secondary, or immortalized 
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cells wh lc h do not egress EPO in detectable cities in 

tt 12 ZZ t state as obtained - In " 

The f lr .t targeting sequence is homologous to sequL., 5- 
of the second targeting sequence, and both se^enceTare 

"tlT", »9ion. The targeting C on 

struc also contsins s regulatory region ,the ^ p L 
meter) , n exon (human grow* hormone (hGH) ) eren 1) 
unpaired splice-donor site. The product o Zlogou" 

CT" Bith this tar9 " lns f-^- 
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Figure l. 



tains^Jt 031 """' *' ME3e " nS C ° nstru « con- 
tains two targeting sequences. The first targeting se 

« £p7re™ia7° l09OUS " "™ " lthi ° * 

Ho^r ^ regi0 "' -* SeCond '"^ting science 

also^ntfT t0 iMr0n J - "» *~* consSuT 

in these two embodiments, the products of the taro.f 
mg events are chimeric transcription units SlTJ^T 

'ZT T in which the fi ~< — -"'.tHgeTi. 

positioned upstream of hEPO exone 2-5. The product of 

30 t ° h f : he re9uiatory — °* target „ 3 

alll T. t0 ° CCUr t0 produce the final, pro- 

cessed transcript. ' p 

The invention further relates to a method of pro- 
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DNA by homologous recombination to produce a homologously 
recombinant cell. The homologously recombinant cell is 
then maintained under conditions which will permit tran- 
scription, translation and secretion, resulting in produc- 
5 tion of the protein of interest. 

The present invention relates to transfected cells, 
such as transfected primary or secondary cells (i.e., non- 
immortalized cells) and transfected immortalized cells, 
useful for producing proteins, particularly therapeutic 
10 proteins, methods of making such cells, methods of using 
the cells for jji yjtro protein production, and methods of 
gene therapy. Cells of the present invention are of 
vertebrate origin, particularly of mammalian origin, and 
even more particularly of human origin. Cells produced by 
15 the method of the present invention contain DNA which 
encodes a therapeutic product, DNA which is itself a 
therapeutic product and/or DNA which causes the 
transfected cells to express a gene at a higher level or 
with a pattern of regulation or induction that is differ- 
20 ent than occurs in the corresponding nontransfected cell. 
The present invention also relates to methods by 
which cells, such as primary, secondary, and immortalized 
cells, are transfected to include exogenous genetic mate- 
rial, methods of producing clonal cell strains or heterog- 
25 enous cell strains, and methods of immunizing animals or 
producing antibodies in immunized animals, using the 
transfected primary, secondary, or immortalized cells. 

The present invention relates particularly to a 
method of gene targeting or homologous recombination in 
30 eukaryotic cells, such as cells of fungal, plant or ani- 
mal, e.g., vertebrate, particularly mammalian, and even 
more particularly, human' origin. That is, it relates to a 
method of introducing DNA into primary, secondary, or 
immortalized cells of vertebrate origin through homologous 
35 recombination, such that the DNA is introduced into genom- 
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selected with ^encTtHS^lrr h"". "* 

♦.u sice into which the dna in 

the targets » construct is „ be 

"^""^ * recombinant 

primary, secondary, or immortalised cells, referred to ss 
homologously recombinant (HR1 „„<..,, eierrea to as 

tali..rf IHR) primary, secondary or immor- 

o£ th „ * the PrMent -«*■» and to uses 

of the HR primary, secondary, or immortalised cells! 

express^rof^" 61 " " ' 1 ~ u - in 

^ressron of a gen. is altered, the gen. is activated. 

15 result fh. S obtain «l. is activated and, as a 

ment homo! Pr ° teln " ^ reMed - In «*s emhodi- 

ment, homologous recombination is used to replace, dis- 

^th'thl ' ^ "^""^ normally sssociated 

^ . rellT " CSUS " 0b " lned the insertion 

of a regulatory sequence which causes the gene to b. 

' activated JL Gaining many copies of the 

activated endogenous gene are useful for , j» 
Production and gene therapy. ^ ^ Pr ° tein 

Preset *"* aii * lific ^°n as disclosed in the 

Present invention are particularly useful for activating 
^ expression of genes which for. transcription units 
wh lch are sufficiently large that they are difficult to 
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isolate and express, or for activating genes for which the 
entire protein coding region is unavailable or has not 
been cloned. 

In a further embodiment, expression of a gene which 
5 is expressed in a cell as obtained is enhanced or caused 
to display a pattern of regulation or induction that is 
different than evident in the corresponding nontransfected 
cell, in another embodiment, expression of a gene which 
is expressed in a cell as obtained is reduced (i.e., 
10 lessened or eliminated) . The present invention also de- 
scribes a method by which homologous recombination is used 
to convert a gene into a cDNA copy, devoid of introns, for 
transfer into yeast or bacteria for in vitro protein 
production. 

15 Transfected cells of the present invention are useful 

in a number of applications in humans and animals. In one 
embodiment, the cells can be implanted into a human or an 
animal for protein delivery in the human or animal. For 
example, hGH, hEPO, human insulinotropin, and other pro- 

20 teins can be delivered systemically or locally in humans 
for therapeutic benefits, m addition, transfected non- 
human cells producing growth hormone, erythropoietin, 
insulinotropin and other proteins of non-human origin may 
be produced. 

25 Barrier devices, which contain transfected cells 

which express a therapeutic product and through which the 
therapeutic product is freely permeable, can be used to 
retain cells in a fixed position in yiya or to protect and 
isolate the cells from the host's immune system. Barrier 
devices are particularly useful and allow transfected 
immortalized cells, transfected xenogeneic cells, or 
transfected allogeneic cells to be implanted for treatment 
of human or animal conditions or for agricultural uses 
(e.g., bovine growth hormone for dairy production) . 
35 Barrier devices also allow convenient short-term" (i.e., 
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any reason. In addLI„ T V 9 "*" U " hal " d £ « 
allogeneic cells 1 i '"""acted xenogeneic and 
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Produce produced by cTeTr, raPy ' ^ "» «™ 
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- bu M ns an" ^"^^ « - — g 

transited cells cj * T^ZITZJ^^ 
gens that result •, . aej - lve r immunizing anti- 

Wral JZ re" ' s h ° St ' 1 » d 

Signed f or protection of ^TZJIT*" « * 
5 agents (i e , ° m future infectious 

- -^srs^cr Md aw 

Purposes. Rentable berri.r ?"" P * Uti = ° r 
can be used to allow Hi", » the cells 

-» to the antigen Altl'^T °' ■•»- 
wiU ultimately oe reietteT" ' ° £ «»* 

transf.cted cellsT ™ T '^cgeneic or allogeneic 

antigen, .L fZiZ T '° *"«""«• '° ">° 

cells have ^r** ^ "« — * 

producT,:::; 3 : ::coX r :r~r r ba - - 

ing a wide varietv of ,k ^""rtaUzed cells produc- 

including .buT^ S^HTT' Pr0dU= "' 
antigens, antibodies. ^ clo'tTTfa t™""' 
proteins, receotors „™,H / C1 °" ln 9 £ a=tors, transport 
Proteins till regulatory proteins, structural 

» ^ddi tionX r £ao r s ' rib °^ s - 

can be used tTp ~du« ceu" Tit ^ 1 ~* 
P oauce cells which produce non-naturally 
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occurring ribozymes, proteins, or nucleic acids which are 
useful for in vitro production of a therapeutic product or 
for gene therapy. 

Brief Description of the Drawing * 

5 Figure 1 is a schematic diagram of a strategy for 

transcriptionally activating the hEPO gene; thick lines, 
mouse metallothionein I promoter; stippled box, 5' un- 
translated region of hGH; solid box, hGH exon 1; striped 
box, 10 bp splice-donor sequence from hEPO intron 1; 
10 cross-hatched box, 5' untranslated region of hEPO; open 
numbered boxes, hEPO coding sequences; diagonally-stripped 
box, hEPO 3' untranslated sequences; HIII, Hindlll site. 

Figure 2 is a schematic diagram of a strategy for 
transcriptionally activating the hEPO gene; thick lines, 
15 mouse metallothionein I promoter; stippled box, 5' un- 
translated region of hGH; solid box, hGH exon 1; open 
numbered boxes, hEPO coding sequences; diagonally-stripped 
box, hEPO 3' untranslated sequences; HIII, Hindlll site. 
Figure 3 is a schematic representation of plasmid 
20 pXGHS, which includes the hGH gene under the control of 
the mouse metallothionein promoter. 

Figure 4 is a schematic representation of plasmid 
pE3neoEPO. The positions of the human erythropoietin gene 
and the neomycin phosphotranferase gene (neo) and 
25 ampicillin (amp) resistance genes are indicated. Arrows 
indicate the directions of transcription of the various 
genes. pmMTl denotes the mouse metallothionein promoter 
(driving hEPO expression) and pTK denotes the Herpes 
Simplex Virus thymidine kinase promoter (driving neo 
30 expression) . The dotted regions of the map mark the 
positions of sequences derived from the human hypoxan- 
thine-guanine phosphoribosyl transferase (HPRT) locus. 
The relative positions of restriction endonuclease recog- 
nition sites are indicated. 
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(HF) and targeted (Tl) human fibroblast clone HF342-15 
(see Example 7) . 

Figure 10 is a schematic representation of plasmid 
PREP018. Fragments derived from genomic hEPO sequences 
5 are indicated by filled boxes. The region between BamHI 
(3537) and Clal (7554) corresponds to sequences at posi- 
tions 1-4008 in Genbank entry HDMERPALU. The region 
between ATG (12246) and Hindlll (13426) corresponds to DNA 
sequence at positions 4009-5169 in Genbank entry 
10 HUMERPLAD. The region between Hindlll (13426) and Xhol 
(624) contains sequence corresponding to positions 7-624 
of Genbank entry HDMERPA. CMV promoter sequences are 
shown as an open box and contains sequence from nucleo- 
tides 546-2015 of Genbank sequence HS5MIEP. The 
15 dihydrofolate reductase (dhfr) transcription unit is shown 
as a stippled box with an arrow. The neo gene is shown as 
an open box with an arrow. The tk promoter driving the 
neo gene is shown as a hatched box. pBSIISK+ sequences 
including the amp gene are indicated by a thin line. 
20 Figure 11 is a schematic illustration of a construct 

of the invention for activating and amplifying an 
intronless gene, the a- interferon gene, where the con- 
struct comprises a first targeting sequence (1) , an ampli- 
f iable marker gene (AM) , a selectable marker gene (SM) , a 
25 regulatory sequence, a CAP site, a splice-donor site (SD) , 
an intron (thin lines) , a splice-acceptor site (SA) and a 
second targeting sequence (2) . The black box represents 
coding DNA and the stippled boxes represent untranslated 
sequences. 

JO Figure 12 is a schematic illustration of a construct 

of the invention for activating and amplifying an endoge- 
nous gene wherein the first exon contributes to the signal 
peptide, the human GM-CSF gene, where the construct com- 
prises a first targeting sequence (1), an amplif iable 

15 marker gene (AM) , a selectable marker gene (SM) , a regula- 
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tory sequence, a CAP site a 

coding DBA and the stiool J * * "° XeS "Present 

serenes. * " OXeS "Present untranslated 

5 Figure 13 is . schematic illustrate- „« 
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(a) a targeting sequence, (b) a regulatory sequence, (c) 
an exon, (d) a splice-donor site, (e) an intron, and (f) a 
splice-acceptor site, wherein the targeting sequence 
directs the integration of elements (a) - (f ) such that 
5 the elements of (b) - (f ) are operatively linked to the 
first exon of the endogenous gene. The targeting sequen- 
ces used are selected with reference to the site into 
which the DNA is to be inserted. In both embodiments the 
targeting event is used to create a new transcription 
10 unit, which is a fusion product of sequences introduced by 
the targeting DNA constructs and the endogenous cellular 
gene. As discussed herein, for example, the formation of 
the new transcription unit allows transcriptionally silent 
genes (genes not expressed in a cell prior to transfec- 
15 tion) to be activated in host cells by introducing into 
the host cell's genome DNA constructs of the present 
invention. As also discussed herein, the expression of an 
endogenous gene which is expressed in a cell as obtained 
can be altered in that it is increased, reduced, including 
eliminated, or the pattern of regulation or induction may 
be changed through use of the method and DNA constructs of 
the present invention. 

The present invention as set forth above, relates to 
a method of gene or DNA targeting in cells of eukaryotic 
25 origin, such as of fungal, plant or animal, such as, 

vertebrate, particularly mammalian, and even more particu- 
larly human origin. That is, it relates to a method of 
introducing DNA into a cell, such as primary, secondary, 
or immortalized cells of vertebrate origin, through homol- 
30 ogous recombination or targeting of the DNA, which is 

introduced into genomic DNA of the cells at a preselected 
site. It is particularly related to homologous recombina- 
tion in which the transcription and/or translation prod- 
ucts of endogenous genes are modified through the use of 
DNA constructs comprising a targeting sequence, a regula- 
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The following is a description of the DNA constructs 
of the present invention, methods in which they are used 
to produce transfected cells, transfected cells and uses 
of these cells. 

5 The DNA Const-n^ 

The DNA construct of the present invention includes 
at least the following components: a targeting sequence; 
a regulatory sequence; an exon and an unpaired splice - 
donor site. In the construct, the exon is 3' of the 
10 regulatory sequence and the unpaired splice-donor site is 
3' of the exon. In addition, there can be multiple exons 
and/or introns preceding (5' to) the exon flanked by the 
unpaired splice-donor site. As described herein, there 
frequently are additional construct components, such as a 
15 selectable markers or amplifiable markers. 

The DNA in the construct may be referred to as exoge- 
nous. The term "exogenous" is defined herein as DNA which 
is introduced into a cell by the method of the present 
invention, such as with the DNA constructs defined herein. 
20 Exogenous DNA can possess sequences identical to or dif- 
ferent from the endogenous DNA present in the cell prior 
to trans fection. 



25 



30 



The Targeting Sequence or B» m i»n roa 

The targeting sequence or sequences are DNA sequences 
which permit legitimate homologous recombination into the 
genome of the selected cell containing the gene of inter- 
est. Targeting sequences are, generally, DNA sequences 
which are homologous to (i.e., identical or sufficiently 
similar to cellular DNA such that the targeting sequence 
and cellular DNA can undergo homologous recombination) DNA 
sequences normally present in the genome of the cells as 
obtained (e.g., coding or noncoding DNA, lying upstream of 
the transcriptional start site, within, or downstream of 
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the transcriptional stop site of a «n« * 

" rCUlar PlMnid ° r DIK £ «*"«" Preferably 

a « or interest (such as, the sequences of an 
exon and/or intron,. irately adjl00nt \T 
xntereat «.... v,ith no additional nucleotides between L 

■ — ! targeted gene presently known or 

"rally unch.raet.rUed but can be capped using restric^ 

ins.^f ^ ' 9ene ^"^9 «n be uaed to 

g"e aa a a r Tr 0ry £ ™ » *"«.nt 

Mm,!.- sources, or synthesized as a novel 

or addition ceiiuxar gene. Alternatively 

abm T ' 8eqUenCeS WhiCh aff6Ct the -trueture or 
stabxlxty of the RNA or protein produced can be replaced 

IZ^' ^f d ' ° r ° therWi8e m ° di£ied -rget^r or' 
exa^e k* A stability elements, splice sites, and/or 
leader sequences of RNA mo iecules can be modified to 
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improve or alter the function, stability, and/or translat- 
ability of an RNA molecule. Protein sequences may also be 
altered, such as signal sequences, propeptide sequences, 
active sites, and/or structural sequences for enhancing or 
5 modifying transport, secretion, or functional properties 
of a protein. According to this method, introduction of 
the exogenous DNA results in the alteration of the normal 
expression properties of a gene and/or the structural 
properties of a protein or RNA. 

10 The Reoulatorv Sequence 

The regulatory sequence of the DNA construct can be 
comprised of one or more promoters (such as a constitutive 
or inducible promoter) , enhancers, scaffold-attachment 
regions or matrix attachment sites, negative regulatory 

15 elements, transcription factor binding sites, or combina- 
tions of said sequences. 

The regulatory sequence can contain an inducible 
promoter, with the result that cells as produced or as 
introduced into an individual do not express the product 

20 but can be induced to do so (i.e., expression is induced 
after the transfected cells are produced but before im- 
plantation or after implantation) . DNA encoding the 
desired product can, of course, be introduced into cells 
in such a manner that it is expressed upon introduction 

25 (e.g., under a constitutive promoter). The regulatory 
sequence can be isolated from cellular or viral genomes, 
(such regulatory sequences include those that regulate the 
expression of SV40 early or late genes, adenovirus major 
late genes, the mouse metallothionein- I gene, the elonga- 

30 tion factor- lor gene, cytomegalovirus genes, collagen 

genes, actin genes, immunoglobulin genes or the HMG-CoA 
reductase gene) . The regulatory sequence preferably con- 
tains transcription factor binding sites, such as a TATA 
Box, CCAAT Box, API, Spl or NF-xB binding sites. 
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represented as: (A/C) AG GURAGU (where R denotes a purine 
nucleotide) with the GU in the fourth and fifth positions, 
being required (Jackson, I.J., Nucleic Acids Research 19: 
3715-3798 (1991)). The first three bases of the splice- 
5 donor consensus site are the last three bases of the exon. 
Splice-donor sites are functionally defined by their 
ability to effect the appropriate reaction within the mRNA 
splicing pathway. 

An unpaired splice-donor site is defined herein as a 
10 splice-donor site which is present in a targeting con- 
struct and is not accompanied in the construct by a 
splice -acceptor site positioned 3' to the unpaired splice- 
donor site. The unpaired splice-donor site results in 
splicing to an endogenous splice -acceptor site. 
15 A splice -acceptor site in a sequence which, like a 

splice-donor site, directs the splicing of one exon to 
another exon. Acting in conjunction with a splice-donor 
site, the splicing apparatus uses a splice -acceptor site 
to effect the removal of an intron. Splice-acceptor sites 
10 have a characteristic sequence represented as: YYYYYYYYYY- 
NYAG, where Y denotes any pyrimidine and N denotes any 
nucleotide (Jackson, I.J., Nucleic Acids Research 19: 3715- 
3798 (1991)). 

An intron is defined as a sequence of one or more 
15 nucleotides lying between two exons and which is removed, 
by splicing, from a precursor RNA molecule in the forma- 
tion of an mRNA molecule. 

The regulatory sequence is, for example, operatively 
linked to an ATG start codon, which initiates translation. 
0 Optionally, a CAP site (a specific mRNA initiation site 
which is associated with and utilized by the regulatory 
region) is operatively linked to the regulatory sequence 
and the ATG start codon. Alternatively, the CAP site 
associated with and utilized by the regulatory sequence is 
5 not included in the targeting construct, and the trans- 
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splice-acceptor site is included in the targeting con- 
struct, the splicing event removes the intron introduced 
by the targeting construct. 

The encoding DNA (e.g., in exon 1 of the targeting 
construct) employed can optionally encode one or more 
amino acids, and/or a portion of an amino acid, which are 
the same as those of the endogenous protein. The enco- 
ding DNA sequence employed herein can, for example, corre- 
spond to the first exon of the gene of interest. The 
encoding DNA can alternatively encode one or more amino 
acids or a portion of an amino acid different from the 
first exon of the protein of interest. Such an embodiment 
is of particular interest where the amino acids of the 
first exon of the protein of interest are not critical to 
15 the activity or activities of the protein. For example, 
when fusions to the endogenous hEPO gene are constructed, 
sequences encoding the first exon of hGH can be employed. 
In this example, fusion of hGH exon 1 to hEPO exon 2 
results in the formation of a hybrid signal peptide which 
20 is functional, in related constructs, any exon of human 
or non-human origin in which the encoded amino acids do 
not prevent the function of the hybrid signal peptide can 
be used. In a related embodiment, this technique can also 
be employed to correct a mutation found in a target gene. 

Where the desired product is a fusion protein of the 
endogenous protein and encoding sequences in the targeting 
construct, the exogenous encoding DNA incorporated into 
the cells by the present method includes DNA which encodes 
one or more exons or a sequence of cDNA corresponding to a 
translation or transcription product which is to be fused 
to the product of the endogenous targeted gene. In this 
embodiment, targeting is used to prepare chimeric or 
multifunctional proteins which combine structural, enzy- 
matic, or ligand or receptor binding properties from two 
35 or more proteins into one polypeptide. For example, the 



25 



30 



WO 9501560 



PCT/US95/06045 



-22- 



20 



25 



30 



exogenous DNA can encode an anchor to the membrane for the 
targeted protein or a signal peptide to provide or improve 
cellular secretion, leader sequences, enzymatic regions, 
transmembrane domain regions, co-factor binding regions or 

5 other functional regions Btamni.. . 

-.- te ^°ns. Examples of proteins which are 

not normally secreted, but which could be fused to a 
sxgnal protein to provide secretion include dopa-decarbox- 
ylase, transcriptional regulatory proteins, «-galactosi- 
dase and tyrosine hydroxylase. 
0 Where the first exon of the targeted gene corresponds 

to a non-codxng region (for example, the first exon of the 
fomcle-stxmulating hormone beta (FSH/?) gene, an exoge- 
nous ATG is not required and, preferably, is omitted. 

in H t ° f C ° nStrUCt ^ be ° btained purees 
xn whxch xt occurs in nature or can be produced, using 

genetic engineering techniques or synthetic processes 

Tjie Target Gepf , ^ ^m ltA™ ggg^ 

as p r^JT C ° nS r Ct ' Wh6n transfected -to cells, such 
as prxmary, secondary or immortalized cells, can control 
the expression of a desired product for example, the 
active or, functional portion of the protein or Rna. The 
product can be for- pv aB ni„ , 

an oe, for example, a hormone, a cytokine, an 

antigen, an antibody, an enzyme, a clotting factor, a 
transport protein, a receptor, a regulatory protein, a 
structu^ protein, a transcription factor, an anti^ense 
Z ' Additionally, the product can be a 

protein or a nucleic acid which does not occur in nature 
U.e., a fusion protein or nucleic acid). 

The method as described herein can produce one or 
more therapeutic products, such as erythropoietin, calci- 
tonin, growth hormone, insulin, insulinotropin, insulin- 
like growth factors, parathyroid hormone, interferon B 

TarlTlT ^ grOWth faCt ° rS ' FSH ^ to« 
necrosis factor, glucagon, bone growth factor-2, bone 
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growth factor- 7, TSH-0, interleukin 1, interleukin 2, 
interleukin 3, interleukin 6, interleukin 11, interleukin 
12, CSF-granulocyte, CSF-macrophage, CSF-granulocyte/ 
macrophage, immunoglobulins, catalytic antibodies, protein 
5 kinase C, glucocerebrosidase , superoxide dismutase, tissue 
plasminogen activator, urokinase, antithrombin III, DNAse, 
of-galactosidase, tyrosine hydroxylase, blood clotting 
factors V, blood clotting factor VII, blood clotting 
factor VIII, blood clotting factor IX, blood clotting 
10 factor X, blood clotting factor XIII, apolipoprotein E or 
apolipoprotein A- I, globins, low density lipoprotein 
receptor, IL-2 receptor, IL-2 antagonists, alpha-1 anti- 
trypsin, immune response modifiers, and soluble CD4. 

Selectable Markers and a^p lif ieat-.inn 

15 The identification of the targeting event can be 

facilitated by the use of one or more selectable marker 
genes. These markers can be included in the targting 
construct or be present on different constructs. Select- 
able markers can be divided into two categories: posi- 

20 tively selectable and negatively selectable (in other 
words, markers for either positive selection or negative 
selection) . in positive selection, cells expressing the 
positively selectable marker are capable of surviving 
treatment with a selective agent (such as neo, xanthine- 

25 guanine phosphoribosyl transferase (gpt) , dhfr, adenosine 
deaminase (ada) , puromycin (pac) , hygromycin (hyg) , CAD 
which encodes carbamyl phosphate synthase, aspartate 
transcarbamylase, and dihydro-orotase glutamine synthetase 
(GS) , multidrug resistance 1 (mdrl) and histidine D 

30 (hisD) , allowing for the selection of cells in which the 
targeting construct integrated into the host cell genome. 
In negative selection, cells expressing the negatively 
selectable marker are destroyed in the presence of the 
selective agent. The identification of the targeting 
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event can be facilitated by the use of one or more marker 
genes exhibiting the property of negative selection, such 
that the negatively selectable marker is linked to the 
exogenous DNA, but configured such that the negatively 
5 selectable marker flanks the targeting sequence, and such 
that a correct homologous recombination event with se- 
quences in the host cell genome does not result in the 
stable integration of the negatively selectable marker 
(Mansour, S.L. ej: fil., U^liZS. 11£ :348-352 (1988)). Mark- 
10 ers useful for this purpose include the Herpes Simplex 

Virus thymidine kinase (TK) gene or the bacterial opt 

gene. 

A variety of selectable markers can be incorporated 
into primary, secondary or immortalized cells. For exam- 
15 pie, a selectable marker which confers a selectable pheno- 
type such as drug resistance, nutritional auxotrophy, 
resistance to a cytotoxic agent or expression of a surface 
protein, can be used. Selectable marker genes which can 
be used include neo, gpt, dhfr, ada, pac, hyg, CAD, GS, 
20 mdrl and hisD. The selectable phenotype conferred makes 
it possible to identify and isolate recipient cells. 

Amplifiable genes encoding selectable markers (e.g., 
ada, GS, dhfr and the multifunctional CAD gene) have the 
added characteristic that they enable the selection of 
cells containing amplified copies of the selectable marker 
inserted into the genome. This feature provides a mecha- 
nism for significantly increasing the copy number of an 
adjacent or linked gene for which amplification is desir- 
able. Mutated versions of these sequences showing im- 
proved selection properties and other amplifiable sequenc- 
es can also be used. 

The order of components in the DNA construct can 
vary. Where the construct is a circular plasmid, the 
order of elements in the resulting structure can be: 
targeting sequence - plasmid DNA (comprised of sequences 
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used for the selection and/or replication of the targeting 
plasmid in a microbial or other suitable host) - select- 
able marker (s) - regulatory sequence - exon - splice-donor 
site. Preferably, the plasmid containing the targeting 
5 sequence and exogenous DNA elements is cleaved with a 
restriction enzyme that cuts one or more times within the 
targeting sequence to create a linear or gapped molecule 
prior to introduction into a recipient cell, such that the 
free DNA ends increase the frequency of the desired homol- 
10 ogous recombination event as described herein, in addi- 
tion, the free DNA ends may be treated with an exonuclease 
to create protruding 5' or 3' overhanging single-stranded 
DNA ends to increase the frequency of the desired homolo- 
gous recombination event. In this embodiment, homologous 
15 recombination between the targeting sequence and the 

cellular target will result in two copies of the targeting 
sequences, flanking the elements contained within the 
introduced plasmid. 

Where the construct is linear, the order can be, for 
20 example: a first targeting sequence - selectable marker - 
regulatory sequence - an exon - a splice-donor site - a 
second targeting sequence or, in the alternative, a first 
targeting sequence - regulatory sequence - an exon - a 
splice-donor site - DNA encoding a selectable marker - a 
25 second targeting sequence. Cells that stably integrate 
the construct will survive treatment with the selective 
agent; a subset of the stably transfected cells will be 
homologously recombinant cells. The homologously recombi- 
nant cells can be identified by a variety of techniques, 
30 including PCR, Southern hybridization and phenotypic 
screening. 

In another embodiment, the order of the construct can 
be: a first targeting sequence - selectable marker - 
regulatory sequence - an exon - a splice-donor site - an 
35 intron - a splice -acceptor site - a second targeting sequence. 
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Alternatively, the order of components in the DNA 
construct can be, for example: a first targeting sequence 
- selectable marker 1 - regulatory sequence - an exon - a 
splice-donor site - a second targeting sequence - select - 
5 able marker 2, or, alternatively, a first targeting se- 
quence - regulatory sequence - an exon - a splice-donor 
site - selectable marker 1 - a second targeting sequence - 
selectable marker 2. m this embodiment selectable marker 
2 displays the property of negative selection. That is 
10 the gene product of selectable marker 2 can be selected' 
against by growth in an appropriate media formulation 
containing an agent (typically a drug or metabolite ana- 
log) which kills cells expressing selectable marker 2 
Recombination between the targeting sequences flanking 
selectable marker 1 with homologous sequences in the host 
cell genome results in the targeted integration of select- 
able marker 1, while selectable marker 2 is not integrat- 
ed, such recombination events generate cells which are 
stably transfected with selectable marker 1 but not stably 
transfected with selectable marker 2, and such cells can 
be selected for by growth in the media containing the 
selective agent which selects for selectable marker 1 and 
the selective agent which selects against selectable 
marker 2. 

The DNA construct also can include a positively 
selectable marker that allows for the selection of cells 
containing amplified copies of that marker. The amplifi- 
cation of such a marker results in the co-amplification of 
flanking DNA sequences. In this embodiment, the order of 
construct components is, for example: a first targeting 
sequence - an amplifiable positively selectable marker - a 
second selectable marker (optional) - regulatory 
sequence - an exon - a splice-donor site - a second tar- 
geting DNA sequence. 
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In this embodiment, the activated gene can be further 
amplified by the inclusion of a selectable marker gene 
which has the property that cells containing amplified 
copies of the selectable marker gene can be selected for 
by culturing the cells in the presence of the appropriate 
selectable agent. The activated endogenous gene will be 
amplified in tandem with the amplified selectable marker 
gene. Cells containing many copies of the activated 
endogenous gene may produce very high levels of the de- 
sired protein and are useful for is vitro protein produc- 
tion and gene therapy. 

In any embodiment, the selectable and amplifiable 
marker genes do not have to lie immediately adjacent to 
each other. 

15 Optionally, the DNA construct can include a bacterial 

origin of replication and bacterial antibiotic resistance 
markers or other selectable markers, which allow for 
large-scale plasmid propagation in bacteria or any other 
suitable cloning/host system. A DNA construct which 
includes DNA encoding a selectable marker, along with 
additional sequences, such as a promoter, and splice 
junctions, can be used to confer a selectable phenotype 
upon transfected cells (e.g., plasmid pcDNEO, schematical- 
ly represented in Figure 4) . Such a DNA construct can be 
co- transfected into primary or secondary cells, along with 
a targeting DNA sequence, using methods described herein. 

Transfection and Homolnam,. Rsgojabjja tian 

According to the present method, the construct is 
introduced into the cell, such as a primary, secondary, or 
immortalized cell,' as a single DNA construct, or as sepa- 
rate DNA sequences which 'become incorporated into the 
chromosomal or nuclear DNA of a transfected cell. 

The targeting DNA construct, including the targeting 
sequences, regulatory sequence, an exon, a splice-donor 
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fragments can undergo homologous recombination to form a 
single fragment with the first and second targeting se- 
quences flanking the region of overlap between the two 
original fragments. The product fragment is then in a 
5 form suitable for homologous recombination with the 

cellular target sequences. More than two fragments can be 
used, designed such that they will undergo homologous 
recombination with each other to ultimately form a product 
suitable for homologous recombination with the cellular 
10 target sequences as described above. 

The Homoloaouslv Recombinan t Cells ' 

The targeting event results in the insertion of the 
regulatory sequence of the targeting construct, placing 
the endogenous gene under their control (for example, by 
15 insertion of either a promoter or an enhancer, or both, 
upstream of the endogenous gene or regulatory region) . 
Optionally, the targeting event can simultaneously result 
in the deletion of the endogenous regulatory element, such 
as the deletion of a tissue -specific negative regulatory 
20 element. The targeting event can replace an existing 
element; for example, a tissue- specific enhancer can be 
replaced by an enhancer that has broader or different 
cell-type specificity than the naturally-occurring ele- 
ments, or displays a pattern of regulation or induction 
25 that is different from the corresponding nontransfected 
cell. In this embodiment the naturally occurring sequenc- 
es are deleted and new sequences are added. Alternative- 
ly, the endogenous regulatory elements are not removed or 
replaced but are disrupted of disabled by the targeting 
30 event, such as by targeting the exogenous sequences within 
the endogenous regulatory elements . 

After the DNA is introduced into the cell, the cell 
is maintained under conditions appropriate for homologous 
recombination to occur between the genomic DNA and a 
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portion of the introduced DNA, as is known in the art 
(Capecchi, M.R., Science, 244:1288-1292 (1989)). 

Homologous recombination between the genomic DNA and 
the introduced DNA results in a homologously recombinant 
5 cell, such as a fungal, plant or animal, and particularly, 
primary, secondary, or immortalized human or other mamma- 
lian cell in which sequences which alter the expression of 
an endogenous gene are operatively linked to an endogenous 
gene encoding a product, producing a new transcription 
10 unit with expression and/or coding potential that is 

different from that of the endogenous gene. Particularly, 
the invention includes a homologously recombinant cell 
comprising regulatory sequences and an exon, flanked by a 
splice-donor site, which are introduced at a predetermined 
15 site by a targeting DNA construct, and are operatively 
linked to the second exon of an endogenous gene. Option- 
ally, there may be multiple exogenous exons (coding or 
non-coding) and introns operatively linked to any exon of 
the endogenous gene. The resulting homologously recorabi- 
20 nant cells are cultured under conditions which select for 
amplification, if appropriate, of the DNA encoding the 
amplifiable marker and the novel transcriptional unit 
With or without amplification, cells produced by this 
method can be cultured under conditions, as are known in 
•5 the art, suitable for the expression of the protein, 

thereby producing the protein in xllZB, or the cells can 
be used for in vivo delivery of a therapeutic protein 
(i.e., gene therapy). 

As used herein, the term primary cell includes cells 
present in a suspension of cells isolated from a verte- 
brate tissue source (prior to their being plated, i.e., 
attached to a tissue culture substrate such as a dish or 
flask), cells present in an explant derived from tissue, 
both of the previous types of cells plated for the first 
time, and cell suspensions derived from these plated 
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cells. The term secondary cell or cell strain refers to 
cells at all subsequent steps in culturing. That is, the 
first time a plated primary cell is removed from the 
culture substrate and replated (passaged), it is referred 
5 to herein as a secondary cell, as are all cells in subse- 
quent passages. Secondary cells are cell strains which 
consist of secondary cells which have been passaged one or 
more times. A cell strain consists of secondary cells 
that: 1) have been passaged one or more times; 2) exhibit 
10 a finite number of mean population doublings in culture; 
3) exhibit the properties of contact -inhibited, anchorage 
dependent growth (anchorage -dependence does not apply to 
cells that are propagated in suspension culture) ; and 4) 
are not immortalized. 
15 Immortalized cells are cell lines (as opposed to cell 

strains with the designation "strain" reserved for primary 
and secondary cells) , a critical feature of which is that 
they exhibit an apparently unlimited lifespan in culture. 
Cells selected for the subject method can fall into 
20 four types or categories: 1) cells which do not, as ob- 
tained, make or contain the protein or product (such as a 
protein that is not normally expressed by the cell or a 
fusion protein not normally found in nature) , 2) cells 
which make or contain the protein or product but in quan- 
25 tities other than that desired (such as, in quantities 
less than the physiologically normal lower level for the 
cell as it is obtained), 3) cells which make the protein 
or product at physiologically normal levels for the cell 
as it is obtained, but are to be augmented or enhanced in 
30 their content or production, and 4) cells in which it is 
desirable to change the pattern of regulation or induction 
of a gene encoding a protein. 

Primary, secondary and immortalized cells to be 
transfected by the present method can be obtained from a 
35 variety of tissues and include all cell types which can be 
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maintained in culture. For example, primary and secondary 
cells which can be transfected by the present method 
include fibroblasts, keratinocytes, epithelial cells 
(e.g., mammary epithelial cells, intestinal epithelial 
5 cells) , endothelial cells, glial cells, neural cells, 
formed elements of the blood (e.g., lymphocytes, bone 
marrow cells) , muscle cells and precursors of these somat- 
ic cell types. Where the horaologously recombinant cells 
are to be used in gene therapy, primary cells are prefera- 
10 bly obtained from the individual to whom the transfected 
primary or secondary cells are administered. However, 
primary cells can be obtained from a donor (other than the 
recipient) of the same species. 

Horaologously recombinant immortalized cells can also 
15 be produced by the present method and used for either 

protein production or gene therapy. Examples of immortal- 
ized human cell lines useful for protein production or 
gene therapy by the present method include, but are not 
limited to, HT1080 cells (ATCC CCL 121) , HeLa cells and 
derivatives of HeLa cells (ATCC CCL 2, 2.1 and 2.2), MCF-7 
breast cancer cells (ATCC BTH 22) , K-562 leukemia cells 
(ATCC CCL 243) , KB carcinoma cells (ATCC CCL 17) , 2780AD 
ovarian carcinoma cells (Van der Blick, A.M. ££. 
Cancer Res, 4fl ; 5927-5932 (1988), Raji cells (ATCC CCL 88) , 
25 Jurkat cells (ATCC TIB 152) , Namalwa cells (ATCC CRL 

1432), HL-60 cells (ATCC CCL 240), Daudi cells (ATCC CCL 
213), RPMI 8226 cells (ATCC CCL 155), U-937 cells (ATCC 
CRL 1593), Bowes Melanoma cells (ATCC CRL 9607) , WI-38VA13 
subline 2R4 cells (ATCC CLL 75.1), and MOLT-4 cells (ATCC 
CRL 1582), as well as heterohybridoma cells produced by 
fusion of human cells and cells of another species. 
Secondary human fibroblast strains, such as Wl-38 (ATCC 
CCL 75) and MRC-5 (ATCC CCL 171) may be used. In addi- 
tion, primary, secondary, or immortalized human cells, as 
well as primary, secondary, or immortalized cells from 
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other species which display the properties of gene ampli- 
fication in vitro can be used for in vitro protein produc- 
tion or gene therapy. 

Method of Converting a Gene into a cDNA Copy 
5 The present invention also relates to a method by 

which homologous recombination is used to convert a gene 
into a cDNA copy (a gene copy devoid of introns) . The 
cDNA copy can be transferred into yeast or bacteria for in 
yi£ro protein production, or the cDNA copy can be inserted 
10 into a mammalian cell for in vitro or in vivo protein 

production. If the cDNA is to be transferred to microbial 
cells, two DNA constructs containing targeting sequences 
are introduced by homologous recombination, one construct 
upstream of and one construct downstream of a human gene 
15 encoding a therapeutic protein. For example, the sequenc- 
es introduced upstream include DNA sequences homologous to 
genomic DNA sequences at or upstream of the DNA encoding 
the first amino acid of a mature, processed therapeutic 
protein; a retroviral long term repeat (LTR) ; sequences 
20 encoding a marker for selection in microbial cells; a 

regulatory element that functions in microbial cells; and 
DNA encoding a leader peptide that promotes secretion from 
microbial cells with a splice-donor site. The sequences 
introduced upstream are introduced near to and upstream of 
25 genomic DNA encoding the first amino acid of a mature, 
processed therapeutic protein. The sequences introduced 
downstream include DNA sequences homologous to genomic DNA 
sequences at or downstream of the DNA encoding the last 
amino acid of a mature, processed protein; a microbial 
30 transcriptional termination sequence; sequences capable of 
directing DNA replication in microbial cells; and a retro- 
viral LTR. The sequences introduced downstream are intro- 
duced adjacent to and downstream of the DNA encoding the 
stop codon of the mature, processed therapeutic protein. 
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After the two DNA constructs are introduced into cells, 
the resulting cells are maintained under conditions appro- 
priate for homologous recombination between the introduced 
DNA and genomic DNA, thereby producing homologously recom- 
5 binant cells. Optionally, one or both of the DNA con- 
structs can encode one or more markers for either positive 
or negative selection of cells containing the DNA con- 
struct, and a selection step can be added to the method 
after one or both of the DNA constructs have been intro- 
10 duced into the cells. Alternatively, the sequences encod- 
ing the marker for selection in microbial cells and the 
sequences capable of directing DNA replication in microbi- 
al cells can both be present in either the upstream or the 
downstream targeting construct, or the marker for selec- 
15 tion in microbial cells can be present in the downstream 
targeting construct and the sequences capable of directing 
DNA replication in microbial cells can be present in the 
upstream targeting construct. The homologously recombi- 
nant cells are then cultured under conditions appropriate 
for LTR directed transcription, processing and reverse 
transcription of the RNA product of the gene encoding the 
therapeutic protein. The product of reverse transcription 
is a DNA construct comprising an intronless DNA copy 
encoding the therapeutic protein, operatively linked to 
25 DNA sequences comprising the two exogenous DNA constructs 
described above. The intronless DNA construct produced by 
the present method is then introduced into a microbial 
cell. The microbial cell is then cultured under condi- 
tions appropriate for expression and secretion of the 
30 therapeutic protein. 

In Vivo Prof Pin Production' 

Homologously recombinant cells of the present inven- 
tion are useful, as populations of homologously recombi- 
nant cell lines, as populations of homologously recombi- 
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nant primary or secondary cells, homologously recombinant 
clonal cell strains or lines, homologously recombinant 
heterogenous cell strains or lines, and as cell mixtures 
in which at least one representative cell of one of the 
5 four preceding categories of homologously recombinant 
cells is present. Such cells may be used in a delivery 
system for treating an individual with an abnormal or 
undesirable condition which responds to delivery of a 
therapeutic product, which is either: 1) a therapeutic 
10 protein (e.g., a protein which is absent, underproduced 
relative to the individual's physiologic needs, defective 
or inefficiently or inappropriately utilized in the indi- 
vidual; a protein with novel functions, such as enzymatic 
or transport functions) or 2) a therapeutic nucleic acid 
15 (e.g., rha which inhibits gene expression or has intrinsic 
enzymatic activity) . In the method of the present inven- 
tion of providing a therapeutic protein or nucleic acid, 
homologously recombinant primary cells, clonal cell 
strains or heterogenous cell strains are administered to 
an individual in whom the abnormal or undesirable condi- 
tion is to be treated or prevented, in sufficient quantity 
and by an appropriate route, to express or make available 
the protein or exogenous DNA at physiologically relevant 
levels. A physiologically relevant level is one which 
either approximates the level at which the product is 
normally produced in the body or results in improvement of 
the abnormal or undesirable condition. According to an 
embodiment of the invention described herein, the homo- 
logously recombinant immortalized cell lines to be admin- 
istered can be enclosed in one or more semipermeable 
barrier devices. The permeability properties of the 
device are such that the 'cells are prevented from leaving 
the device upon implantation into an animal, but the 
therapeutic product is freely permeable and can leave the 
barrier device and enter the local space surrounding the 
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implant or enter the systemic circulation. For example, 
hGH, hEPO, human insulinotropin, hGM-CSF, hG-CSF, human a- 
interferon, or human FSH/J can be delivered systemically in 
humans for therapeutic benefits. 
5 Barrier devices are particularly useful and allow 

homologously recombinant immortalized cells, homologously 
recombinant cells from another species (homologously 
recombinant xenogeneic cells) , or cells from a nonhisto- 
compatibility-matched donor (homologously recombinant 
10 allogeneic cells) to be implanted for treatment of human 
or animal conditions or for agricultural uses (i.e., meat 
and dairy production) . Barrier devices also allow conve- 
nient short-term (i.e., transient) therapy by providing 
ready access to the cells for removal when the treatment 
15 regimen is to be halted for any reason. 

A number of synthetic, semisynthetic, or natural 
filtration membranes can be used for this purpose, includ- 
ing, but not limited to, cellulose, cellulose acetate, 
nitrocellulose, polysulfone, polyvinylidene difluoride, 
20 polyvinyl chloride polymers and polymers of polyvinyl 

chloride derivatives. Barrier devices can be utilized to 
allow primary, secondary, or immortalized cells from 
another species to be used for gene therapy in humans. 

In Vitrr> grofcein Production 

25 Homologously recombinant cells from human or non- 

human species according to this invention can also be used 
for ia vj.t;;rp protein production. The cells are maintained 
under conditions, as are known in the art, which result in 
expression of the protein. Proteins expressed using the 

30 methods described may be purified from cell lysates or 
cell supernatants in order' to purify the desired protein. 
Proteins made according to this method include therapeutic 
proteins which can be delivered to a human or non-human 
animal by conventional pharmaceutical routes as is known 
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xn the art (e.g., oral, intravenous, intramuscular, intra- 
nasal or subcutaneous) . such proteins include hGH, hEPO 
and human insulinotropin, hGM-CSF, hG-CSF, FSH/J or a- 
interferon. These cells can be immortalized, primary, or 
secondary cells. The use of cells from other species may 
be desirable in cases where the non-human cells are advan- 
tageous for protein production purposes where the non- 
human protein is therapeutically or commercially useful 
for example, the use of cells derived from salmon for the 
production of salmon calcitonin, the use of cells derived 
from pigs for the production of porcine insulin, and the 
use of bovine cells for the production of bovine growth 
hormone . 

Advantage 

The methodologies, DMA constructs, cells, and resul- 
ting proteins of the invention herein possess versatility 
and many other advantages over processes currently em- 
ployed within the art in gene targeting. The ability to 
activate an endogenous gene by positioning an exogenous 
regulatory sequence at various positions ranging from 
immediately adjacent to the gene of interest (directly 
fused to the normal gene's transcribed region) to 30 
kxlobase pairs or further upstream of the transcribed 
regzon of an endogenous gene, or within an intron of an 
endogenous gene, is advantageous for gene expression in 
cells. For example, it can be employed to position the 
regulatory element upstream or downstream of regions that 
normally silence or negatively regulate a gene. The 
positioning of a regulatory element upstream or downstream 
of such a region can override such dominant negative 
effects that normally inhibit transcription, in addition 
regions of DNA that normally inhibit transcription or have 
an otherwise detrimental effect on the expression of a ' 
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gene may be deleted using the targeting constructs, des- 
cribed herein. 

Additionally, since promoter function is known to 
depend strongly on the local environment, a wide range of 
5 positions may be explored in order to find those local 
environments optimal for function. However, since, ATG 
start codons are found frequently within mammalian DNA 
(approximately one occurrence per 48 base pairs) , tran- 
scription cannot simply initiate at any position upstream 
10 of a gene and produce a transcript containing a long 
leader sequence preceding the correct ATG start codon, 
since the frequent occurrence of ATG codons in such a 
leader sequence will prevent translation of the correct 
gene product and render the message useless. Thus, the 
15 incorporation of an exogenous exon, a splice-donor site, 
and, optionally, an intron and a splice-acceptor site into 
targeting constructs comprising a regulatory region allows 
gene expression to be optimized by identifying the optimal 
site for regulatory region function, without the limita- 
tion imposed by needing to avoid inappropriate ATG start 
codons in the mRNA produced. This provides significantly 
increased flexibility in the placement of the construct " 
and makes it possible to activate a wider range of genes. 
The DNA constructs of the present invention are also 
25 useful, for example, in processes for making fusion pro- 
teins encoded by recombinant, or exogenous, sequences and 
endogenous sequences. 

Gene targeting and amplification as disclosed above 
are particularly useful for altering on the expression of 
30 genes which form transcription units which are sufficient- 
ly large that they are difficult to isolate and express, 
or for turning on genes for which the entire protein 
coding region is unavailable or has not been cloned. 
Thus, the DNA constructs described above are useful for 
35 operatively linking exogenous regulatory elements to 
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endogenous genes in a way that precisely defines the 
transcriptional unit, provides flexibility in the relative 
positioning of exogeneous regulatory elements and endoge- 
nous genes ultimately, enables a highly controlled system 
for obtaining and regulating expression of genes of thera- 
peutic interest. 

Explanation of the Rv^i^ 

As described herein, Applicants have demonstrated 
that DNA can be introduced into cells, such as primary, 
secondary or immortalized vertebrate cells and integrated 
into the genome of the transfected cells by homologous 
recombination. They have further demonstrated that the 
exogenous DNA. has the desired function in the homologously 
recombinant (HR) cells and that correctly targeted cells 
15 can be identified on the basis of a detectable phenotype 
conferred by the properly targeted DNA. 

Applicants describe construction of a plasraid useful 
for targeting to a particular locus (the HPRT locus) in 
the human genome and selection based upon a drug resistant 
20 phenotype (Example la) . This plasmid is designated pE3Neo 
and its integration into the cellular genome at the HPRT 
locus produces cells which have an hprt", 6-TG resistant 
phenotype and are also G418 resistant. As described, they 
have shown that pE3Neo functions properly in gene target- 
25 ing in an established human fibroblast cell line (Example 
lb) , by demonstrating localization of the DNA introduced 
mto established cells within exon 3 of the HPRT gene. 

In addition, Applicants demonstrate gene targeting in 
primary and secondary human skin fibroblasts using pE3Neo 
30 (Example 1c) . The subject application further demon- 
strates that modification -of DNA termini enhances target- 
ing of DNA into genomic DNA (Examples lc and le) . 
Applicants also describe methods by which a gene can be ' 
inserted at a preselected site in the genome of a cell, 
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such as a primary, secondary, or immortalized cell by gene 
targeting (Example id) . 

In addition, the present invention relates to a 
method of protein production using transfected cells. The 
5 method involves transfecting cells, such as primary cells, 
secondary cells or immortalized cells, with exogenous DNA 
which encodes a therapeutic product or with DNA which is 
sufficient to target to an endogenous gene which encodes a 
therapeutic product. For example, Examples lg, lh, lj, 
10 lk, 2, 3, 4 and 6-9 describe protein production by 

targeting of a selected endogenous gene with DNA sequence 
elements which will alter the expression of the endogenous 
gene. 

Applicants also describe DNA constructs and methods 
15 for amplifying an endogenous cellular gene that has been 
activated by gene targeting (Examples 3, 6, 8 and 9) . 

Examples lf-ih, 2, 4 and 6 illustrate embodiments in 
which the normal regulatory sequences upstream of the 
human EPO gene are altered to allow expression of hEPO in 
primary or secondary fibroblast strains which do not 
express EPO in detectable quantities in their untrans- 
fected state. In one embodiment the product of targeting 
leaves the normal EPO protein intact, but under the con- 
trol of the mouse metallothionein promoter. Examples li 
5 and lj demonstrate the use of similar targeting constructs 
to activate the endogenous growth hormone gene in primary 
or secondary human fibroblasts. In other embodiments 
described for activating EPO expression in human fibro- 
blasts, the products of targeting events are chimeric 
• transcription units, in which the first exon of the human 
growth hormone gene is positioned upstream of EPO exons 2- 
5. The product of transcription (controlled by the mouse 
metallothionein promoter) , splicing, and translation is a 
protein in which amino acids 1-4 of the hEPO signal pep- 
tide are replaced with amino acid residues 1-3 of hGH. 
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The chimeric portion of this protein, the signal peptide, 
is removed prior to secretion from cells. Example 5 
describes targeting constructs and methods for producing 
cells which will convert a gene (with introns) into an 
5 expressible cDNA copy of that gene (without introns) and 
the recovery of such expressible cDNA molecules in micro- 
bial (e.g., yeast or bacterial) cells. Example 6 de- 
scribes construction of a targeting vector, designated 
PREP04 for dual selection and selection of cells in which 
10 the dhfr gene is amplified. Plasmid pREP04 has been used 
to amplify the human EPO (hEPO) locus in HT1080 cells (an 
immortalized human cell line) after activation of the 
endogenous hEPO gene by homologous recombination. As 
described, stepwise selection in methotrexate -containing 
media resulted in a 70 -fold increase in hEPO production in 
cells resistant to 0.4 nM methotrexate. 

Examples 7 and 8 describe methods for inserting a 
regulatory sequence upstream of the normal EPO promoter 
and methods for EPO production using such a construct. In 
addition, Example 8 describes the amplification of a 
targeted EPO gene produced by the method of Example 7. 
Example 9 describes methods for targeting the human a- 
interferon, GM-CSF, G-CSF, and FSH0 genes to create cells 
useful for in protein production. 

The Examples provide methods for activating or for 
activating and amplifying endogenous genes by gene target- 
ing which do not require manipulation or other uses of the 
target genes' protein coding regions. Using the methods 
and DNA constructs or plasmids taught herein or modifica- 
tions thereof which are apparent to one of ordinary skill 
in the art, gene expression can be altered in cells that 
have properties desirable 'for in vitro protein production 
(e.g., pharmaceutics) or in vjvp protein delivery methods 
(e.g. gene therapy). Figures 5 and 6 illustrate two 
strategies for transcriptionally activating the hEPO gene. 
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Using the methods and DNA constructs or plasmids 
taught herein or modifications thereof which are apparent 
to one of ordinary skill in the art, exogenous DNA which 
encodes a therapeutic product (e.g., protein, ribozyme, 
anti-sense RNA) can be inserted at preselected sites in 
the genome of vertebrate (e.g., mammalian, both human and 
nonhuman) primary or secondary cells. 

The present invention will now be illustrated by the 
following examples, which are not intended to be limiting 
in any way. 
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EXAMPLES 

EXAMPLE L PRODUCTION OF TRANfiPPr"r p,D CELL STRAINS BY G™* 
TARGETING 

Gene targeting occurs when transfecting DNA either 
5 integrates into or partially replaces chromosomal DNA 
sequences through a homologous recombinant event. While 
such events can occur in the course of any given transfec- 
tion experiment, they are usually masked by a vast excess 
of events in which plasmid DNA integrates by nonhomolo- 
10 gous, or illegitimate, recombination. 
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NATION OP & mwOTPTTCT TTfiPPTTT, P QR SELRPTTOW ni? 
, GENE TARGETING RVPW TS IN H^ Maw rpr.T.c 
One approach to selecting the targeted events is by 
genetic selection for the loss of a gene function due to 
15 the integration of transfecting DNA. The human HPRT locus 
encodes the enzyme hypoxanthine-phosphoribosyl transfer- 
ase, hprt" cells can be selected for by growth in medium 
containing the nucleoside analog 6-thioguanine (6-TG) : 
cells with the wild- type (HPRT+) allele are killed by 
6-TG, while cells with mutant (hprt") alleles can survive. 
Cells harboring targeted events which disrupt HPRT gene 
function are therefore selectable in 6-TG medium. 

To construct a plasmid for targeting to the HPRT 
locus, the 6.9 kb Hindlll fragment extending from posi- 
tions 11,960-18,869 in the HPRT sequence (Genebank name 
HOMHPRTB ; Edwards, A. al. , Genomics £:593-608 (1990)) 
and including exons 2 and 3 of the HPRT gene, is subcloned 
into the Hindlll site of pUC12. The resulting clone is 
cleaved at the unique Xhol site in exon 3 of the HPRT gene 
fragment and the ill kb Sall-Xhol fragment containing the 
neo gene from pMClNeo (Stratagene) is inserted, disrupting 
the coding sequence of exon 3. One orientation, with the 
direction of neo transcription opposite that of HPRT 
transcription was chosen and designated pE3Neo. The 
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replaceraent of the normal HPRT exon 3 with the neo-disrup- 
ted version will result in an hprt", 6-TG resistant pheno- 
type. Such cells will also be G418 resistant. 

b. GENE TARGETING IN AN ESTABLISHED HTI M AN FIBBOBTAfiT 
5 CELL LINE 

As a demonstration of targeting in immortalized cell 
lines, and to establish that pE3Neo functions properly in 
gene targeting, the human fibrosarcoma cell line HT1080 
(ATCC CCL 121) was transfected with pE3Neo by electropora- 
10 tion. 

HT1080 cells were maintained in HAT (hypoxanthine/ 
aminopterin/xanthine) supplemented DMEM with 15% calf 
serum (Hyclone) prior to electroporation. Two days before 
electroporation, the cells are switched to the same medium 
15 without aminopterin. Exponentially growing cells were 
trypsinized and diluted in DMEM/15% calf serum, centri- 
fuged, and resuspended in PBS (phosphate buffered saline) 
at a final cell volume of 13.3 million cells per ml. 
pE3Neo is digested with HindHI, separating the 8 kb 
20 HPRT-neo fragment from the pUC12 backbone, purified by 
phenol extraction and ethanol precipitation, and resus- 
pended at a concentration of 600 iig/uCL. 50 /il (30 fig) was 
added to the electroporation cuvette (0.4 cm electrode 
gap; Bio-Rad Laboratories) , along with 750 fil of the cell 
25 suspension (10 million cells). Electroporation was at 450 
volts, 250 /iFarads (Bio-Rad Gene Pulser; Bio-Rad Laborato- 
ries) . The contents of the cuvette were immediately added 
to DMEM with 15% calf serum to yield a cell suspension of 
1 million cells per 25 ml media. 25 ml of the treated 
30 cell suspension was plated onto 150 mm diameter tissue 
culture dishes and incubated at 37°C, 5% C0 2 . 24 hrs 
later, a G418 solution was added directly to the plates to 
yield a final concentration of 800 fig/ml G418. Five days 
later the media was replaced with DMEM/15% calf serum/ 
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800 M g/ml G418. Nine days after electroporation, the 
media was replaced with DMEM/15% calf serum/800 Azg/ml G418 
and 10 mm 6-thioguanine. Colonies resistant to G418 and 
6-TG were picked using cloning cylinders 14-16 days after 
the dual selection was initiated. 

The results of five representative targeting experi- 
ments in HT1080 cells are shown in Table 1. 

TABLE 1 

Transfection Number of Number of G418 r 

Transfection Treated Cells 6-TG r Clones 



1 1 x 10' 3 2 

2 1 x 10' 



28 
24 
32 

5 1 x 10' 66 



3 1 x 10' 

4 1 X 10' 



For transfection 5, control plates designed to deter- 
mine the overall yield of G418* colonies indicated that 
33,700 G418' colonies could be generated from the initial 
1 x 10' treated cells. Thus, the ratio of targeted to 
non-targeted events is 66/33,700, or 1 to 510. In the 
five experiments combined, targeted events arise at a 
frequency of 3.6 x 10«, or 0.00036* of treated cells. 

Restriction enzyme and Southern hybridization experi- 
ments using probes derived from the neo and HPRT genes 
localized the neo gene to the HPRT locus at the predicted 
site within HPRT exon 3. 
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C - PENE TARGETING 'IW PRT MARY AWn SECONDARY WTTMAM flTTTM 
FIBROBLASTS 

pE3Neo is digested with Hindlll, separating the 8 kb 
HPRT-neo fragment from the pUC12 backbone, and purified by 
phenol extraction and ethanol precipitation. DNA was 
resuspended at 2 mg/ml. Three million secondary human 
foreskin fibroblasts cells in a volume of 0.5 ml were 
electroporated at 250 volts and 960 /^Farads, with 100 jig 
of Hindlll pE3Neo (50 fil) . Three separate transfections 
were performed, for a total of 9 million treated cells. 
Cells are processed and selected for G418 resistance. 
500,000 cells per 150 mm culture dish were plated for G418 
selection. After 10 days under selection, the culture 
medium is replaced with human fibroblast nutrient medium 
15 containing 400 /tg/ml G418 and 10 /xM 6-TG. Selection with 
the two drug combination is continued for 10 additional 
days. Plates are scanned microscopically to localize 
human fibroblast colonies resistant to both drugs. The 
fraction of G418 r t-TG r colonies is 4 per 9 million treat - 
20 ed cells. These colonies constitute 0.0001% (or 1 in a 
million) of all cells capable of forming colonies. Con- 
trol plates designed to determine the overall yield of 
G418* colonies indicated that 2,850 G418 r colonies could 
be generated from the initial 9 x 10 s treated cells. 
Thus, the ratio of targeted to non- targeted events is 
4/2,850, or 1 to 712. Restriction enzyme and Southern 
hybridization experiments using probes derived from the 
neo and HPRT genes were used to localize the neo gene to 
the HPRT locus at the predicted site within HPRT exon 3 
and demonstrate that targeting had occurred in these four 
clonal cell strains'. Colonies resistant to both drugs 
have also been isolated by transfecting primary cells 
(1/3.0 x 10 7 ) . 

The results of several pE3Neo targeting experiments 
are summarized in Table 2. Hindlll digested pE3Neo was 
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either transfected directly or treated with exonuclease 
III to generate 5' single -stranded overhangs prior to 
transfection (see Example lc) . DNA preparations with 
single-stranded regions ranging from 175 to 930 base pairs 
. m length were tested. Using pE3neo digested with Hindlll 
alone, 1/799 G418-resistant colonies were identified by 
restriction enzyme and Southern hybridization analysis as 
having a targeted insertion of the neo gene at the HPRT 
locus (a total of 24 targeted clones were isolated) . 
Targeting was maximally stimulated (approximately 10-fold 
stimulation) when overhangs of 175 bp were used, with 1/80 
G418* colonies displaying restriction fragments that are 
diagnostic for targeting at HPRT (a total of 9 targeted 
clones were isolated) . Thus, using the conditions and 
recombinant DNA constructs described here, targeting is 
readily observed in normal human fibroblasts and the 
overall targeting frequency (the number of targeted clones 
divided by the total number of clones stably transfected 
to G4l8-resistance) can be stimulated by transfection with 
targeting constructs containing single-stranded overhang- 
ing tails, by the method as described in Example le. 

TABLE 2 

TARGETS TO THE HPRT T^OT9 T» HT7MRM m^y ,^ 
T?eatSLnt- /umber of Number Targeted Total Number of 

Hindlll digest 6 1/799 2 4 

175 bp overhang 1 1/80 9 

350 bp overhang 3 i/n 7 2Q 

930 bp overhang 1 1/144 x 
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d - GENERATION OF A' CONSTRUCT FOR TAB B ED TWSBPTTnw pp fl 
GENE OF THERAPEI7TT C INTEREST INTO THE HUMAN BRwnjup 
AND ITS TT SE IN GENE TARGETTKW 

A variant of pE3Neo, in which a gene of therapeutic 
5 interest is inserted within the HPRT coding region, adja- 
cent to or near the neo gene, can be used to target a gene 
of therapeutic interest to a specific position in a recip- 
ient primary or secondary cell genome. Such a variant of 
pE3Neo can be constructed for targeting the hGH gene to 
10 the HPRT locus. 

pXGH5 (schematically presented in Figure 3) is di- 
gested with EcoRI and the 4.1 kb fragment containing the 
hGH gene and linked mouse metallothionein (mMT) promoter 
is isolated. The EcoRI overhangs are filled in with the 
Klenow fragment from £. coli DNA polymerase. Separately, 
pE3Neo is digested with Xhol, which cuts at the junction 
of the neo fragment and HPRT exon 3 (the 3' junction of 
the insertion into exon 3) . The Xhol overhanging ends of 
the linearized plasmid are filled in with the Klenow 
fragment from £. coli DNA polymerase, and the resulting 
fragment is ligated to the 4.1 kb blunt -ended hGH-mMT 
fragment. Bacterial colonies derived from the ligation 
mixture are screened by restriction enzyme analysis for a 
single copy insertion of the hGH-mMT fragment and one 
orientation, the hGH gene transcribed in the same direc- 
tion as the neo gene, is chosen and designated pE3Neo/hGH. 
pE3Neo/hGH is digested with Hindlll, releasing the 12.1 kb 
fragment containing HPRT, neo and mMT-hGH sequences. 
Digested DNA is treated and transfected into primary or 
secondary human fibroblasts as described in Example lc. 
G418 r TG r colonies are selected and analyzed for targeted 
insertion of the mMT-hGH and neo sequences into the HPRT 
gene as described in Example lc. Individual colonies are 
assayed for hGH expression using a commercially available 
immunoassay (Nichols Institute) . 
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Secondary human fibroblasts were transfected with 
pE3Neo/hGH and thioguanine-resistant colonies were ana- 
lyzed for stable hGH expression and by restriction enzyme 
and Southern hybridization analysis. Of thirteen TG' 
colonies analyzed, eight colonies were identified with an 
insertion of the hGH gene into the endogenous HPRT locus. 
All eight strains stably expressed significant quantities 
of hGH, with an average expression level of 22.7 pg/io« 
cells/24 hours. Alternatively, plasmid P E3neoEP0, Figure 
4, may be used to target EPO to the human HPRT locus. 

The use of homologous recombination to target a gene 
of therapeutic interest to a specific position in a cell's 
genomic DNA can be expanded upon and made more useful for 
producing products for therapeutic purposes (e.g., pharma- 
ceutics, gene therapy) by the insertion of a gene through 
which cells containing amplified copies of the gene can be 
selected for by exposure of the cells to an appropriate 
drug selection regimen. For example, P E3neo/hGH (Example 
Id) can be modified by inserting the dfafr, ada, or CAD 
gene at a position immediately adjacent to the hGH or neo 
genes in P E3neo/hGH. Primary, secondary, or immortalized 
cells are transfected with such a plasmid and correctly 
targeted events are identified. These cells are further 
treated with increasing concentrations of drugs appropri- 
ate for the selection of cells containing amplified genes 
(for dhfr, the selective agent is methotrexate, for CAD 
the selective agent is N- (phosphonacetyl) -L-aspartate 
(PALA) , and for ada the selective agent is an adenine 
nucleoside (e.g., alanosine) . m this manner the integra- 
tion of the gene of therapeutic interest will be coampli- 
fied along with the gene for which amplified copies are 
selected. Thus, the genetic engineering of cells to 
produce genes for therapeutic uses can be readily con- 
trolled by preselecting the site at which the targeting' 
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construct integrates and at which the amplified copies 
reside in the amplified cells. 

e. MODIFICATION OP DTJ A TERM TNT TO ENHANCE TARGETING 

Several lines of evidence suggest that 3 ' -overhanging 
5 ends are involved in certain homologous recombination 
pathways of £. coli, bacteriophage, £. cerevislae and 
Xenopus laevis. In Xenopus laevis oocytes, molecules with 
3 '-overhanging ends of several hundred base pairs in 
length underwent recombination with similarly treated 
10 molecules much more rapidly after microinjection than 
molecules with very short overhangs (4 bp) generated by 
restriction enzyme digestion. In yeast, the generation of 
3 '-overhanging ends several hundred base pairs in length 
appears to be a rate limiting step in meiotic recombinati- 
15 on. No evidence for an involvement of 3 ' -overhanging ends 
in recombination in human cells has been reported, and in 
no case have modified DNA substrates of any sort been 
shown to promote targeting (one form of homologous recom- 
bination) in any species. The experiment described in the 
20 following example and Example lc suggests that 5' -over- 
hanging ends are effective for stimulating targeting in 
primary, secondary and immortalized human fibroblasts. 

There have been no reports on the enhancement of 
targeting by modifying the ends of the transfecting DNA 
25 molecules. This example serves to illustrate that modifi- 
cation of the ends of linear DNA molecules, by conversion 
of the molecules' termini from a double -stranded form to a 
single- stranded form, can stimulate targeting into the 
genome of primary and secondary human fibroblasts. 
10 lioo fig of plasmid pE3Neo (Example la) is digested 

with Hindlll. This DNA cati be used directly after phenol 
extraction and ethanol precipitation, or the 8 kb Hindlll 
fragment containing only HPRT and the neo gene can be 
separated away from the pUC12 vector sequences by gel 
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electrophoresis. Exolll digestion of the Hindlll digested 
2 -suits in extensive exonucleolytic digestion at each 
end, xnitxatxng at each free 3' end, and leaving 5<- 

5 °7T gin9 endS ' ^ 6Xtent ° f -onucleolytic action 
5 and, hence, the length of the resulting 5' -overhangs, can 
be controlled by varying the time of ExoIII digestion 
ExoIII digestion of loo ^ of Hindlll digested pE3Neo'xs 
carrxed out according to the supplier's recorded condi- 

10 L ; tlmeS ° f 30 86C ' 1 1 * 5 2 2.5 

mxn 3 mxn 3.5 min, 4 .in, 4.5 min, and 5 min. To moni- 
tor the extent of digestion an aliquot from each time . 
Point, containing 1 w of j^ouz treat ed DNA, is treated 
with mung bean nuclease (Promega) , under conditions recom- 
mended by the supplier, and the samples fractionated by 
L5 gel electrophoresis. The difference in si 2e between 
non-treated, Hindlll digested pE3Neo and the same mole- 
cules treated with ExoIII and mung bean nuclease is mea- 
sured. This size difference divided by two gives the 

0 m ir a9 n 6 len3th ° f ^ 5 '- 0veri **S « each end of the 
0 molecule. Using the time points described above and 

digestion at 30-, the S'-overhangs produced should range 

from 100 to 1,000 bases. 

" ? ° f 2X0111 treated » (total Hindlll digest of 
PEBNeo) from each time point is purified and electropor- 
ated xnto prxmary, secondary, or immortalized human fibro- 
blasts under the conditions described in Example ic. The 
degree to which targeting is enhanced by each ExoIII 

lTTl PrePaXatim 18 *Y "-ting the number 

of G418< 6-TG r colonies and comparing these numbers to 
targeting with Hindlll digested pE3Neo that was not treat- 
ed with ExoIII. 

The effect of 3 ■ -overhanging ends can also be quanti- 
sed usxng an analogous system. i„ this case Hindlll 
dxgested p E 3Neo is treated with bacteriophage T7 gene 6 ' 
exonuclease (United states Biochemicals, for varying time 
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intervals under the supplier' s recommended conditions. 
Determination of the extent of digestion (average length 
of 3 '-overhang produced per end) and electroporation 
conditions are as described for ExoIII treated DNA. The 
5 degree to which targeting is enhanced by each T7 gene 6 
exonuclease treated preparation is quantified by counting 
the number of (3418* 6-TG' colonies and comparing these 
numbers to targeting with Hindlll digested pE3Neo that was 
not treated with T7 gene 6 exonuclease. 
0 Other methods for generating 5' and 3' overhanging 

ends are possible, for example, denaturation and annealing 
of two linear molecules that partially overlap with each 
other will generate a mixture of molecules, each molecule 
having 3 '-overhangs at both ends or 5' -overhangs at both 
> ends, as well as reannealed fragments indistinguishable 
from the starting linear molecules. The length of the 
overhangs is determined by the length of DNA that is not 
in common between the two DNA fragments. 



CONSTRUCTION OF TARGETING PLARMT fiS FOR PLACTNn THF 
HUMAN ERYTHROPOIETIN QRN F, UNDER THF CONTROL OF THF 
MOUSE METALLOTHTONEIN PROMOTER IN PRTMA RY. SECONDARY 
AND IMMORTAT.T5ED HUMAN FIBROBLASTS 

The following serves to illustrate one embodiment of 
the present invention, in which the normal positive and 
negative regulatory sequences upstream of the human eryth- 
ropoietin (hEPO) gene are altered to allow expression of 
human erythropoietin in primary, secondary or immortalized 
human fibroblasts, which do not express hEPO in signifi- 
cant quantities as obtained. 

A region lying exclusively upstream of the human EPO 
coding region can be amplified by PCR. Three sets of 
primers useful for this purpose were designed after analy- 
sis of the published human EPO sequence [Genbank designa- 
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tion HUMERPA; Lin, F-K. , ^ ^. , Proc. Natl , Acad, ggj 
USA 82:7580-7584 (1985)) . These primer pairs can amplify 
fragments of 609, 603, or 590 bp. 

TABLE 3 

HUMERPA 

Primer Coordinate Sequence Fragment Size 



F1 2-20 5' AGCTTCTGGGCTTCCAGAC 

{SEQ ID NO 1) 

R2 610-595 5' GGGGTCCCTCAGCGAC 609 b D 

(SEQ ID NO 2) P 

F2 8 -» 24 5' TGGGCTTCCAGACCCAG 

(SEQ ID NO 3) 

R2 610 ■* 595 5' GGGGTCCCTCAGCGAC 603 bp 

F3 21 - 40 5' CCAGCTACTTTGCGGAACTC 

(SEQ ID NO 4} 

R2 610 - 595 5' GGGGTCCCTCAGCGAC 590 bp 

The three fragments overlap substantially and are 
interchangeable for the present purposes. The 609 bp 
fragment, extending from -623 to -14 relative to the 
translation start site (HUMERPA nucleotide positions 2 to 
610), is ligated at both ends with Clal linkers. The 
resulting Clal-linked fragment is digested with Clal and 
inserted into the Clal site of P BluescriptIISK/+ (strata- 
gene) , with the orientation such that HUMERPA nucleotide 
position 610 is adjacent to the Sail site in the plasmid 
polylinker). This plasmid, pS'EPO, can be cleaved, sepa- 
rately, at the unique Fspl or Sfil sites in the human EPO 
upstream fragment (HUMERPA nucleotide positions 150 and 
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405, respectively) and ligated to the mouse metallothion- 
ein promoter. Typically, the 1.8 kb EcoRI-Bglll from the 
mMT-I gene [containing no mMT coding sequences; Hamer, 
D.H. and Walling M. , J. Mol. AddI. Gen. 1:273 288 (1982); 
5 this fragment can also be isolated by known methods from 
mouse genomic DNA using PCR primers designed from analysis 
of mMT sequences available from Genbank; i.e., MUSMTI, 
MUSMTIP, MUSMTI PRM] is made blunt-ended by known methods 
and ligated with Sfil digested (also made blunt -ended) or 
Fspl digested pS'EPO. The orientations of resulting clones 
are analyzed and those in which the former mMT Bglll site 
is proximal to the Sail site in the plasmid polylinker are 
used for targeting primary and secondary human fibro- 
blasts. This orientation directs mMT transcription to- 
wards HUMERPA nucleotide position 610 in the final con- 
struct. The resulting plasmids are designated p5'EPO-mMTF 
and p5'EPO-mMTS for the mMT insertions in the Fspl and 
Sfil sites, respectively. 

Additional upstream sequences are useful in cases 
where it is desirable to modify, delete and/or replace 
negative regulatory elements or enhancers that lie up- 
stream of the initial target sequence. In the case of 
EPO, a negative regulatory element that inhibits EPO 
expression in extrahepatic and extrarenal tissues [Semen- 
za, G.L. st Sl-f Mol. Cell. Biol, ji o * Q30«- a (1990)] can 
be deleted. A series of deletions within the 6 kb frag- 
ment are prepared. The deleted regions can be replaced 
with an enhancer with broad host -cell activity [e.g. an 
enhancer from the Cytomegalovirus (CMV)]. 

The orientation of the 609 bp 5 'EPO fragment in the 
pBluescriptIISK/+ vector was chosen since the HUMERPA 
sequences are preceded on their 5' end by a BamHI (distal) 
and Hindlll site (proximal). Thus, a 6 kb BamHI-Hindlll 
fragment normally lying upstream of the 609 bp fragment 
[Semenza, G. L. £t al^, Mol, Cell. Biol. 1^:930-938 
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(1990)] can be isolated from genomic DNA by known methods. 
For example, a bacteriophage, cosmid, or yeast artificial 
chromosome library could be screened with the 609 bp PCR 
amplified fragment as a probe. The desired clone will 
5 have a 6 kb BamHI-Hindlll fragment and its identity can be 
confirmed by comparing its restriction map from a restric- 
tion map around the human EPO gene determined by known 
methods. Alternatively, constructing a restriction map of 
the human genome upstream of the EPO gene using the 609 bp 
10 fragment as a probe can identify enzymes which generate a 
fragment originating between HDMERPA coordinates 2 and 609 
and extending past the upstream BamHl site; this fragment 
can be isolated by gel electrophoresis from the appropri- 
ate digest of human genomic DNA and ligated into a bacte- 
15 rial or yeast cloning vector. The correct clone will 
hybridize to the 609 bp 5 'EPO probe and contain a 6 kb 
BamHI-Hindlll fragment. The isolated 6 kb fragment is 
inserted in the proper orientation into p5'EP0, pS'EPO- 
mMTF, or pS'EPO-mMTS (such that the Hindlll site is adja- 
20 cent to HDMERPA nucleotide position 2) . Additional up- 
stream sequences can be isolated by known methods, using 
chromosome walking techniques or by isolation of yeast 
artificial chromosomes hybridizing to the 609 bp 5 'EPO 
probe . 

The cloning strategies described above allow sequenc- 
es upstream of EPO to be modified jjj yifero for subsequent 
targeted transfection of primary, secondary or immortal- 
ized human fibroblasts. The strategies describe simple 
insertions of the mMT promoter, as well as deletion of the 
negative regulatory region, and deletion of the negative 
regulatory region and replacement with an enhancer with 
broad host -cell activity. ' 
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9- ACTIVATING THE ■ HTTMA M EPO GENE AND ISOLATION OF TAR- 
GETED PRIMARY. SECON DARY AND TMM0RTAT,T7En mmw 
FIBROBLASTS BY firpp^WT^ 

For targeting, the plasraids are cut with restriction 
5 enzymes that free the insert away from the plasmid back- 
bone. In the case of p5'EP0-mMTS, Hindlll and Sail diges- 
tion releases a targeting fragment of 2.4 kb, comprised of 
the 1.8 kb mMT promoter flanked on the 5' and 3' sides by 
405 bp and 204 base pairs, respectively, of DNA for tar- 
10 geting this construct to the regulatory region of the 

human EPO gene. This DNA or the 2.4 kb targeting fragment 
alone is purified by phenol extraction and ethanol precip- 
itation and transfected into primary or secondary human 
fibroblasts under the conditions described in Example lc. 
15 Transfected cells are plated onto 150 mm dishes in human 
fibroblast nutrient medium. 48 hours later the cells are 
plated into 24 well dishes at a density of 10,000 
cells/cm 2 [approximately 20,000 cells per well; if target- 
ing occurs at a rate of 1 event per 10* clonable cells 
(Example lc, then about 50 wells would need to be assayed 
to isolate a single expressing colony] . Cells in which the 
transfecting DNA has targeted to the homologous region 
upstream of the human EPO gene will express hEPO under the 
control of the mMT promoter. After 10 days, whole well 
supernatants are assayed for EPO expression using a com- 
mercially available immunoassay kit (Amgen) . Clones from 
wells displaying hEPO synthesis are isolated using known 
methods, typically by assaying fractions of the heteroge- 
nous populations of cells separated into individual wells 
or plates, assaying fractions of these positive wells, and 
repeating as needed, ultimately isolating the targeted 
colony by screening 96 -well microtiter plates seeded at 
one cell per well. DNA from entire plate lysates can also 
be analyzed by PCR for amplification of a fragment using a 
mMT specific primer in conjunction with a primer lying 
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upstream of HUMERpa nucleotide position 1. This primer 
pair should amplify a DNA fragment of a size precisely 
predicted based on the DNA sequence. Positive plates are 
trypsinized and replated at successively lower dilutions 
5 and the DNA preparation and PGR steps repeated as needed' 
to isolate targeted cells. 

The targeting schemes herein described can also be 
used to activate hGH expression in immortalized human 
cells (for example, HT1080 cells (ATCC CCL 121) , HeLa 
) cells and derivatives of HeLa cells (ATCC CCL2, 2.1 and 
2.2), MCP-7 breast cancer cells (ATCC HBT 22), K-562 
leukemia cells (ATCC CCL 232) , KB carcinoma cells (ATCC 
CCL 17) , 2780AD ovarian carcinoma cells (Van der Blick, 
A.M. eial., .Cancer Res, 48:5927-5932 (1988), Raji cells 
(ATCC CCL 86), Jurkat cells (ATCC TIB 152), Namalwa cells 
(ATCC CRL 1432), HL-60 cells (ATCC CCL 240), Daudi cells 
(ATCC CCL 213), RPMI 8226 cells (ATCC CCL 155), U-937 
cells (ATCC CRL 1593) , Bowes Melanoma cells (ATCC CRL 
9607), WI-38VA13 subline 2R4 cells (ATCC CLL 75.1), MOLT-4 
cells (ATCC CRL 1582) , and varous heterohybridoma cells) 
for the purposes of producing hGH for conventional pharma- 
ceutic delivery. 

h V ACnVATTNO THE HUMAN KPO GFNE AWT) t.^t^tt^t n? ^ 
SETED P3TMARY SErnwnapv ^wp tmmopt^ized himb 
FIBROBLASTS BY * POSITIVE np , CQMUfflg POSTTTV*/ 
NEGATTVE fiFT- ECTION ££EIM 

The strategy for constructing pS'EPO-mMTF, pS'EPO- 
niMTS, and derivatives of such with the additional upstream 
6 kb BamHI-Hindlll fragment can be followed with the addi- 
tional step of inserting the neo gene adjacent to the mMT 
promoter. In addition, a 'negative selection marker, for 
example, gpt [from pMSG (Pharmacia) or another suitable 
source] , can be inserted adjacent to the HUMERPA sequences 
in the pBluescriptllSKA polylinker. m the former case, 
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G418 r colonies are isolated and screened by PCR amplifica- 
tion or restriction enzyme and Southern hybridization 
analysis o£ DNA prepared from pools of colonies to identi- 
fy targeted colonies. In the latter case, G418 r colonies 
5 are placed in medium containing 6-thioxanthine to select 
against the integration of the gpt gene [Besnard, C. e£ 
MaL Cel l- Bigj 2:4139-4141 (1987)]. in addition, 
the HSV-TK gene can be placed on the opposite side of the 
insert as gpt, allowing selection for neo and against both 
10 gpt and TK by growing cells in human fibroblast nutrient 
medium containing 400 /xg/ml G418, 100 fiM 6-thioxanthine, 
and 25 fig/xal gancyclovir. The doubie negative selection 
should provide a nearly absolute selection for true tar- 
geted events and Southern blot analysis provides an ulti- 
15 mate confirmation. 

The targeting schemes herein described can also be 
used to activate hEPO expression in immortalized human 
cells (for example, HT1080 cells (ATCC CCL 121) , HeLa 
cells and derivatives of HeLa cells (ATCC CCL2, 2.1 and 
20 2.2), MCF-7 breast cancer cells (ATCC HBT 22), K-562 

leukemia cells (ATCC CCL 232), KB carcinoma cells (ATCC 
CCL 17) , 2780AD ovarian carcinoma cells (Van der Blick, 
A.M. £aasgr_E££,_ia: 5927-5932 (1988), Raji cells 

(ATCC CCL 86) , Jurkat cells (ATCC TIB 152) , Namalwa cells 
25 (ATCC CRL 1432), HL-60 cells (ATCC CCL 240), Daudi cells 
(ATCC CCL 213), RPMI 8226 cells (ATCC CCL 155), U-937 
cells (ATCC CRL 1593) , Bowes Melanoma cells (ATCC CRL 
9607), WI-38VA13 subline 2R4 cells (ATCC CLL 75.1), MOLT-4 
cells (ATCC CRL 1582), and various heterohybridoma cells) 
10 for the purposes of producing hEPO for conventional phar- 
maceutic delivery. 
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zemnmciim of mmiim el &s mibs eqb *Lmm rir 

HUMAN GROWTH HORMONF. ctmt r ^ ER TWP ro^nx ^ ^ 
MOUSE tWrATJOTHTONFTW PROMDTf p tw p ^y. CTmm>tlwr 
OR IMMQRTALIZEF) wnM ^N FTRffnpr .^c r o 
5 The following example serves to illustrate one em- 

bodiment of the present invention, in which the normal 
regulatory sequences upstream of the human growth hormone 
gene are altered to allow expression of human growth 
hormone in primary, secondary or immortalized human fibro- 
10 blasts. 

Targeting molecules similar to those described in 
Example if for targeting to the EPO gene regulatory region 
are generated using cloned DNA fragments derived from the 
5' end of the human growth hormone N gene. An approxi- 
15 mately 1.8 kb fragment spanning HUMGHCSA (Genbank Entry) 
nucleotide positions 3787-5432 (the positions of two EcoNl 
sites which generate a convenient sized fragment for 
cloning or for diagnostic digestion of subclones involving 
thas fragment) is amplified by PGR primers designed by 
20 analysis of the HUMGHCSA sequence in this region. This 
region extends from the middle of hGH gene N intron 1 to 
an upstream position approximately 1.4 kb 5' to the trans- 
it ional start site. P UC12 is digested with EcoRI and 
BamHI, treated with Klenow to generate blunt ends, and 
S recircularized under dilute conditions, resulting in 

plasmids which have lost the EcoRI and BamHI sites. This 
plasmid is designated P UC12XEB. Hindlll linkers are 
legated onto the amplified hGH fragment and the resulting 
fragment is digested with Hindlll and ligated to Hindlll 
digested pUC12XEB. The resulting plasmid, pUC12XEB-5»hGH 
is digested with EcoRI and BamHI, to remove a 0.5 kb 
fragment lying immediately upstream of the hGH transcrip- 
tional initiation site. The digested DNA is ligated to 
the 1.8 kb EcoRI-Bglll from the iriMT-I gene [containing no 
» mMT coding sequences; Hamer, D.H. and Walling, M., j. Mo1 



WO 95/31560 PCT/US95/06045 



-60- 

Appl. Ggp. 1:273-288 (1982); the fragment can also be 
isolated by known methods from mouse genomic DNA using PCR 
primers designed from analysis of mMT sequences available 
from Genbank; i.e., MUSMTI, MUSMTIP, MUSMTIPRM] . This 
5 plasmid p5'hGH-mMT has the mMT promoter flanked on both 
sides by upstream hGH sequences. 

The cloning strategies described above allow sequenc- 
es upstream of hGH to be modified in vitro for subsequent 
targeted transfection of primary, secondary or immortal - 
10 ized human fibroblasts. The strategy described a simple 
insertion of the mMT promoter. Other strategies can be 
envisioned, for example, in which an enhancer with broad 
host-cell specificity is inserted upstream of the inserted 
mMT sequence. 

15 j- ACTIVATING THE HUMAN hGH GENE AND I S OLATION OF TAP- 
GPTED PR?MASY f SECONDARY AND IMMORTA LIZED HUMAN 
FIBROBLASTS BY SOmaWTiys 

For targeting, the plasmids are cut with restriction 
enzymes that free the insert away from the plasmid back- 

20 bone. In the case of p5'hGH-mMT, Hindlll digestion re- 
leases a targeting fragment of 2.9 kb, comprised of the 
1.8 kb mMT promoter flanked on the 5' end 3' sides by DNA 
for targeting this construct to the regulatory region of 
the hGH gene. This DNA or the 2.9 kb targeting fragment 

25 alone is purified by phenol extraction and ethanol precip- 
itation and transfected into primary or secondary human 
fibroblasts under the conditions described in Example 11. 
Transfected cells are plated onto 150 mm dishes in human 
fibroblast nutrient medium. 48 hours later the cells are 

30 plated into 24 well dishes at a density of 10,000 

cells/cm 2 [approximately 20,000 cells per well; if target- 
ing occurs at a rate of 1 event per 10* clonable cells 
(Example lc) , then about 50 wells would need to be assayed 
to isolate a single expressing colony] . Cells in which the 
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transfecting DNA has targeted to the homologous region 
upstream of hGH will express hGH under the control of the 
mMT promoter. After 10 days, whole well supernatants are 
assayed for hGH expression using a commercially available 
5 immunoassay kit (Nichols, . clones from wells displaying 
hGH synthesis are isolated using known methods, typically 
by assaying fractions of the heterogenous populations of 
cells separated into individual wells or plates, assaying 
fractions of these positive wells, and repeating as need- 
10 ed, ultimately isolated the targeted colony by screening 
96-well microtiter plates seeded at one cell per well 
DNA from entire plate lysates can also be analyzed by'pcR 
for, amplification of a fragment using a mMT specific 

„ ^T er " C ° njUnction with a Primer lying downstream of 
15 HDMGHCSA nucleotide position 5,432. This primer pair 

should amplify a DNA fragment of a size precisely predict- 
ed based on the DNA sequence. Positive plates are tryp- 
simzed and replated at successively lower dilutions, and 
the DNA preparation and PCR steps repeated as needed to 
20 isolate targeted cells. 

The targeting schemes herein described can also be 
used to activate hGH expression in immortalized human 
cells (for example, HT1080 cells (ATCC CCL 121), HeLa 

'5 2 6 2) S MOP *f VatiVeS ° f H6La (ATCC CCL2, 2.1 and 

2.2), MCF-7 breast cancer cells (ATCC HBT 22), K-562 
leukemia cells (ATCC CCL 232), KB carcinoma cells (ATCC 
CCL 17) , 2780AD ovarian carcinoma cells (Van der Blick, 
A.M. et al., Cancer Pop 111:5927-5932 (1988), Raji cells 

0 lllTr T ' ^ CellS <ATCC TIB 152) • cells 
0 ATCC CRL 1432, , HL-80 cells (ATCC CCL 240) , Daudi cells 

(ATCC CCL 213,, RPMI 8226 cells (ATCC CCL 155), D-937 

cells (ATCC CRL 1593), Bowes Melanoma cells (ATCC CRL 

5607), WI-38VA13 subline 2R4 cells (ATCC CLL 75.1) , MOLT-4 

cells (ATCC CRL 1582) , and various heterohybridoma cells) 
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for the purposes of 'producing hGH for conventional pharma- 
ceutic delivery. 

k. ACTIVATING THE HUM AN hGH GENE AND ISOLATION OF TAR- 
GETED PRIMARY , SECONDARY AND IMMORTALI ZED HUMAN 
FIBROBLASTS BY A POSITIVE OR A COMBINED pmsyrrmf 
NEGATIVE SELECTION fiVflTRM 

The strategy for constructing p5 1 hGH-mMT can be 
followed with the additional step of inserting the neo 
gene adjacent to the mMT promoter. In addition, a nega- 
tive selection marker, for example, gpt [from pMSG (Phar- 
macia) or another suitable source] , can be inserted adja- 
cent to the HUMGHCSA sequences in the pUC12 poly-linker. 
In the former case, G418 r colonies are isolated and 
screened by PCR amplification or restriction enzyme and 
Southern hybridization analysis of DNA prepared from pools 
of colonies to identify targeted colonies. In the latter 
case, G418 r colonies are placed in medium containing 
thioxanthine to select against the integration of the gpt 
gene (Besnard, C. £t al^, Mol. Cell. Binl 2 : 4139-4141 
(1987)] . m addition, the HSV-TK gene can be placed on 
the opposite side of the insert as gpt, allowing selection 
for neo and against both gpt and TK by growing cells in 
human fibroblast nutrient medium containing 400 /xg/ml 
G418, 100 fM 6 -thioxanthine, and 25 /xg/ml gancyclovir. 
The double negative selection should provide a nearly 
absolute selection for true targeted events. Southern 
hybridization analysis is confirmatory. 

The targeting schemes herein described can also be 
used to activate hGH expression in immortalized human 
cells (for example, HT1080 cells (ATCC CCL 121), HeLa 
cells and derivatives of HeLa cells (ATCC CCL2, 2.1 and 
2.2), MCF-7 breast cancer cells (ATCC HBT 22), K-562 
leukemia cells (ATCC CCL 232) , KB carcinoma cells (ATCC 
CCL 17) , 2780AD ovarian carcinoma cells (Van der Blick, 
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A.M. fit H., C ancer Res, 48 :5927-5932 (1988), Raji cells 
(ATCC CCL 86), Jurkat cells (ATCC TIB 152), Namalwa cells 
(ATCC CRL 1432), HL-60 cells (ATCC CCL 240) , Daudi cells 
(ATCC CCL 213), R PMI 8 226 cells (ATCC CCL 155), U-937 
cells (ATCC CRL 1593), Bowes Melanoma cells (ATCC CRL 
9607), WI-38VA13 subline 2R4 cells (ATCC CLL 75.1), MOLT-4 
cells (ATCC CRL 1582) , and various heterohybridoma cells) 
for the purposes of producing hGH for conventional pharma- 
ceutic delivery. 

The targeting constructs described in Examples if and 
li, and used in Examples lg, ih , ij an d lk can be modified 

to mclude an amplifiable selectable marker (e.g., ada 
dhfr, or CAD, „ hich is useful for selecting ^ ±r ^ 

the activated endogenous gene, and the amplifiable select - 
15 able marker, are amplified. Such cells, expressing or 
capable of expressing the endogenous gene encoding a 
therapeutic product can be used to produce proteins (e.g., 
hGH and hEPO) for conventional pharmaceutic delivery or 
for gene therapy. 

20 1 ' TRANSFRCTTOW OF PRTM»v m ^ OWDapv vr ^ r ^ 

WITH EXOORNOTTS IM ANT) fi BW.FTTWBLB Mnprap ^ ry 
ELECTROPopa T T ?f 

Exponentially growing or early stationary phase 
fibroblasts are trypsinized and rinsed from the plastic 
25 surface with nutrient medium. An aliquot of the cell 
suspension is removed for counting, and the remaining 
cells are subjected to centrifugation. The supernatant is 
aspirated and the pellet is resuspended in 5 ml of elec- 
?n ^I° P ° rati0n buffer <20 mM HEPES pH 7.3, 137 mM NaCl, 5 mM 
30 KC1, 0.7 mM Na 2 HP0„ 6 mM dextrose). The cells are recen- 
tnfuged, the supernatant 'aspirated, and the cells resus- 
pended in electroporation buffer containing 1 mg/ml acety- 
lated bovine serum albumin. The final cell suspension 
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contains approximately 3 x 10* cells/ml. Electroporation 
should be performed immediately following resuspension. 

Supercoiled plasmid DNA is added to a sterile cuvette 
with a 0.4 cm electrode gap (Bio-Rad.) The final DNA 
5 concentration is generally at least 120 ftg/xal. 0.5 ml of 
the cell suspension (containing approximately 1.5 x 10* 
cells) is then added to the cuvette, and the cell suspen- 
sion and DNA solutions are gently mixed. Electroporation 
is performed with a Gene-Pulser apparatus (Bio-Rad) . 
10 Capacitance and voltage are set at 960 fiF and 250-300 V, 
respectively. As voltage increases, cell survival de- 
creases, but the percentage of surviving cells that stably 
incorporate the introduced DNA into their genome increases 
dramatically. Given these parameters, a pulse time of 
15 approximately 14-20 msec should be observed. 

Electroporated cells are maintained at room tempera- 
ture for approximately 5 min, and the contents of the 
cuvette are then gently removed with a sterile transfer 
pipette. The cells are added directly to 10 ml of pre- 
warmed nutrient media (as above with 15% calf Berum) in a 
10 cm dish and incubated as described above. The follow- 
ing day, the media is aspirated and replaced with 10 ml of 
fresh media and incubated for a further 16-24 hours. 
Subculture of cells to determine cloning efficiency and to 
select for G418 -resistant colonies is performed the fol- 
lowing day. Cells are trypsinized, counted and plated; 
typically, fibroblasts are plated at 10 J cells/10 cm dish 
for the determination of cloning efficiency and at 1-2 x 
10* cells/10 cm dish for G418 selection. 

Human fibroblasts are selected for G418 resistance in 
medium consisting of 300-400 ftg/ml G418 (Geneticin, disul- 
fate salt with a potency of approximately 50%; Gibco) in 
fibroblasts nutrient media (with 15% calf serum) . Cloning 
efficiency is determined in the absence of G418. The 
plated cells are incubated for 12-14 days, at which time 
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colonies are fixed with formalin, stained with crystal 
violet and counted (for cloning efficiency plated) or 
isolated using cloning cylinders (for G418 plates) 
Electroporation and selection of rabbit fibroblasts is 
5 performed essentially as described for human fibroblasts 
with the exception of the selection conditions used 
Rabbit fibroblasts are selected for G418 resistance in 
medium containing l gm/ml G418. 

Fibroblasts were isolated from freshly excised human 
10 foreskins. Cultures were seeded at 50,000 cells/cm in 
DMEM + 10% calf serum. When cultures became confluent 
fibroblasts were harvested by trypsinization and trans- 
fected by electroporation. Electroporation conditions 
were evaluated by transfection with the plasmid pcDNEO 
(Figure 5) . a representative electroporation experiment 
using near optimal conditions (60 m of plasmid pcDNEO at 
an electroporation voltage of 250 volts and a capacitance 
setting of 960 farads) resulted in one G418 colony per 
588 treated cells (0.17% of all cells treated), or one 
0 G418 colony per 71 clonable cells (1.4%). 

When nine separate electroporation experiments at 
near optimal conditions (60 „g of plasmid pcDNEO at an 
electroporation voltage of 300 volts and a capacitance 
setting of 960 /.Farads) were performed, an average of one 
0418 colony per 1,899 treated cells (0.05%) was observed, 
with a range of 1/882 to 1/7,500 treated cells. This 
corresponds to an average of one G4ie colony per 38 clon- 
able cells (2.6%) . 

Low passage primary human fibroblasts were converted 
to hGH expressing cells by co- transfection with plasmids; 
PCDNEO and pXGH5. Typically, 60 „g of an equimolar mix- 
ture of the two plasmids were transfected at near optimal 
conditions (electroporation voltage of 300 volts and a • 
capacitance setting of 960 M Farads) . The results of such 
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an experiment resulted in one G418 colony per 14,705 
treated cells. 

hGH expression data for these and other cells isolat- 
ed under identical transfection conditions are summarized 
below. Ultimately, 98% of all G418 r colonies could be 
expanded to generate mass cultures. 
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Number of G418 r Clones 

Analyzed 
Number of G418 r /hGH 

Expressing Clones 
Average hGH Expression 

Level 

Maximum hGH Expression 
Level 
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2.3 fig hGH/10 s Cells/24 hr 



23.0 fig hGH/10 6 Cells/24 hr 



Stable transfectants also have been generated by 
electroporation of primary or secondary human fibroblasts 
with pXGH301, a DNA construct in which the neo and hGH 
genes are present on the same plasmid molecule. pXGH301 
was constructed by a two-step procedure. The Sall-Clal 
fragment from pBR322 (positions 23-651 in pBR322) was 
isolated and inserted into Sall-Clal digested pcDNEO, 
introducing a BamHl site upstream of the SV40 early pro- 
moter region of pcDNEO. This plasmid, pBNEO was digested 
with BamHi and the 2.1 kb fragment containing the neo gene 
under the control of the SV40 early promoter, was isolated 
and inserted into BamHI digested pXGH5 . A plasmid with a 
single insertion of the 2.1 kb BamHI fragment was isolated 
in which neo and hGH are transcribed in the same direction 
relative to each other. This plasmid was designated 
PXGH301. For example, 1.5' x 10« cells were electroporated 
with 60 ng pXGH301 at 300 volts and 960 /zFarads. G418 
resistant colonies were isolated from transfected second- 
ary fibroblasts at a frequency of 652 G418 resistant 
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colonies per l.s xio treated cells (l per 2299 treated 
cells). Approximately 59% of these colonies express hGH. 

EXAMPLE 2. .CONSTRUCTION OF TAP^pttmp. piasmthq wuxmx f p _ 
SULT IN CTfTMKffTC TRANSCRTPTT ON mTTR T M HiTiro 
HUMAN GT?nwT*f ^? M ONB AWn PP VTHROPnTBT T M pp. 
OUENCEfi ARE PTTflBn 

The following serves to illustrate two further em- 
bodiments of the present invention, in which the normal 
regulatory sequences upstream. of the human EPO gene are 
altered to allow expression of hEPO in primary or second- 
ary fibroblast strains which do not express hEPO in de- 
tectable quantities in their untransfected state as ob- 
tained, m these embodiments, the products of the target- 
ing events are chimeric transcription units in which the 
first exon of the human growth hormone gene is positioned 
upstream of hEPO exons 2-5. The product of transcription, 
splicxng and translation is a protein in which amino acids 
1-4 of the hEPO signal peptide are replaced with amino 
acid residues 1-3 of hGH. The two embodiments differ with 
respect to both the relative positions of the foreign 
regulatory sequences that are inserted and the specific 
pattern of splicing that needs to occur to produce the 
final, processed transcript. 

Plasmid pXEPO-10 is designed to replace exon l of 
hEPO with exon 1 of hGH by gene targeting to the endoge- 
nous hEPO gene on human chromosome 7. Plasmid pXEPO-10 is 
constructed as follows. First, the intermediate plasmid 
PT163 is constructed by inserting the 6 kb Hindlll-BamHI 
fragment (see Example if) lyi„ g upstream of the hEPO 
coding region into Hindlll-BamHI digested pBluescriptll 
SK + (Stratagene, LaJolla, ' CA) . The product of this liga- 
tion is digested with Xhol and HindHI and ligated to the 
1.1 kb Hindlll-Xhol fragment f rom pMClneoPolyA [Thomas, K. 
R. and Capecchi, M. R. £sXl 11: 503-512 (1987) available 
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from Strategene, LaJolla, CA] to create pT163 . Oligo- 
nucleotides 13.1 - 13.4 are utilized in polymerase chain 
reactions to generate a fusion fragment in which the mouse 
metallothionein 1 (raMT-I) promoter - hGH exon 1 sequences 
5 are additionally fused to hEPO intron 1 sequences. First, 
oligonucleotides 13.1 and 13.3 are used to amplify the 
approximately 0.73 kb raMT-I promoter - hGH exon 1 fragment 
from pXGH5 (Figure 5). Next, oligonucleotides 13.2 and 
13.4 are used to amplify the approximately 0.57 kb frag- 
10 ment comprised predominantly of hEPO intron 1 from human 
genomic DNA. Finally, the two amplified fragments are 
mixed and further amplified with oligonucleotides 13 . 1 and 
13.4 to generate the final fusion fragment (fusion frag- 
ment 3) flanked by a Sail site at the 5' side of the mMT-I 
15 moiety and an Xhol site at the 3' side of the hEPO intron 
1 sequence. Fusion fragment 3 is digested with Xhol and 
Sail and ligated to Xhol digested pT163. The ligation 
mixture is transformed into E. coli and a clone containing 
a single insert of fusion fragment 3 in which the Xhol 
20 site is regenerated at the 3' side of hEPO intron 1 se- 
quences is identified and designated pXEPO-10. 

13.1 5' TTT TGTCGAC GGTACCT Tfy; TTTTTAAAAC C 
Sail Kpnl 
(SEQ ID NO 5) 

25 13.2 5' CCTAGCGGCA ATGGCTACAG GTGAGTACTC GCGGGCTGGG CG 
(SEQ ID NO 6) 

13.3 5' CGCCCAGCCC GCGAGTACTC ACCTGTAGCC ATTGCCGCTA GG 

(SEQ ID NO 7) 

13.4 5' TTTTCTCGAG CTAGAACAGA TAGCCAGGCT G 
30 Xhol 

(SEQ ID NO 8) 



The non-boldface region of oligo 13.1 is identi- 
cal to the mMT-I promoter, with the natural Kpnl 
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site as its 5' boundary. The boldface type 
denotes a Sail site tail to convert the 5' boun- 
dary to a Sail site. The boldface region of 
oligos 13.2 and 13.3 denote hGH sequences, while 
the non-boldface regions are intron 1 sequences 
from the hEPO gene. The non-boldface region of 
oligo 13.4 is identical to the last 25 bases of 
hEPO intron 1. The boldface region includes an 
Xhol site tail to convert the 3' boundary of the 
amplified fragment to an Xhol site. 



Plasmid pXEPO-ll is designed to place, by gene tar- 
geting, the mMT-l promoter and exon 1 of hGH upstream of 
the hEPO structural gene and promoter region at the endog- 
enous hEPO locus on human chromosome 7. Plasmid pXEPO-ll 
15 is constructed as follows. Oligonucleotides 13.1 and 13.5 
-13.7 are utilized in polymerase chain reactions to 
generate a fusion fragment in which the mouse metallo- 
thionein 1 (mMT-I) promoter - hGH exon 1 sequences are 
additionally fused to hEPO sequences from -1 to -630 
20 relative to the hEPO coding region. First, oligonucleo- 
tides 13.1 and 13.6 are used to amplify the approximately 
0.75 kb mMT-I promoter - hGH exon 1 fragment from pXGHS 
(Figure 5) . Next, oligonucleotides 13.5 and 13.7 are used 
to amplify, from human genomic DNA, the approximately 
0.65 kb fragment comprised predominantly of hEPO sequences 
from -1 to -620 relative to the hEPO coding region. Both 
oligos 13.5 and 13.6 contain a 10 bp linker sequence 
located at the hGH intron 1 - hEPO promoter region, which 
corresponds to the natural hEPO intron 1 splice-donor 
30 site. Finally, the two amplified fragments are mixed and 
further amplified with oligonucleotides 13.1 and 13.7 to 
generate the final fusion fragment (fusion fragment 6) . 
flanked by a Sail site at the 5' side of the mMT-I moiety 
and an Xhol site at the 3' side of the hEPO promoter 
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region. Fusion fragment 6 is digested with Xhol and Sail 
and ligated to Xhol digested pT163 . The ligation mixture 
is transformed into E. coli and a clone containing a 
single insert of fusion fragment 6 in which the Xhol site 
is regenerated at the 3' side of hEPO promoter sequences 
is identified and designated pXEPO-11. 

13.5 5' GACAGCTCAC CTAGCGGCAA TGGCTACAGG TGAGTACTC 

AAGJCJTCTGG GCTTCCAGAC CCAG (SEQ ID NO 9) 
Hindlll 

13.6 5' CTGGGTCTGG AAGCCCAGAA GCTT GAflrar rCACCTGTAG 

Hindi I I 

CCATTGCCGC TAGGTGAGCT GTC (SEQ ID NO 10) 

13.7 5' TTTTOTCGAG CTCCGCGCCT GGCCGGGGTC CCTC 

Xhol 
(SEQ ID NO 11) 



The boldface regions of oligos 13.5 and 13.6 
denote hGH sequences. The italicized regions 
correspond to the first 10 base pairs of hEPO 
intron 1. The remainder of the oligos corre- 
spond to hEPO sequences from -620 to -597 rela- 
tive to the hEPO coding region. The non-bold- 
face region of oligo 13.7 is identical to bas- 
es -l to -24 relative to the hEPO coding region. 
The boldface region includes an Xhol site tail 
to convert the 3' boundary of the amplified 
fragment to an Xhol site. 

Plasmid pXEPO-10 can be used for gene targeting by 
digestion with BamHI and Xhol to release the 7.3 kb frag- 
ment containing the mMT-I/hGH fusion flanked on both sides 
by hEPO sequences. This fragment (targeting fragment 1) 
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contains no hEPO coding sequences, having only sequences 
lying between -620 and approximately -SS20 upstream of the 

COding re * ion and ^PO intron 1 sequences to direct 
targeting to the human EPO locus. Targeting fragment 1 is 
5 transfected into primary or secondary human skin fibro- 
blasts using conditions similar to those described in 
Example lc. G418 -resistant colonies are picked into 
individual wells of 96-well plates and screened for EPO 
1n ^P ression ^ ^ ELISA assay ( RfiD Systems, Minneapolis 
10 MN) . cells in which the transfecting DNA integrates 

randomly into the human genome cannot produce EPO Cells 
in which the transfecting DNA has undergone homologous 
recombination with the endogenous hEPO intron 1 and hEPO 
upstream sequences contain a chimeric gene in which the 
mMT-1 promoter and non- transcribed sequences and the hGH 
5' untranslated sequences and hGH exon 1 replace the 
normal hEPO promoter and hEPO exon 1 (see Figure 1) . N on- 
hEPO sequences in targeting fragment 1 are joined to hEPO 
sequences down-stream of hEPO intron l. The replacement 
of the normal hEPO regulatory region with the mMT-l pro- 
moter will activate the EPO gene in fibroblasts, which do 

l>H n ,T lly 6XPreSS hBP0, ^ re P lacem «* of hEPO exon 1 
with hGH exon 1 results in a protein in which the first 4 
amino acids of the hEPO signal peptide are replaced with 
am ln o acids 1-3 of hGH, creating a functional, chimeric 
signal peptide which is removed by post -translation pro- 
cessing from the mature protein and is secreted from the 
expressing cells. 

Plasmid pXEPO-li can be used for gene targeting by 
digestion with BamHI and Xhol to release the 7.4 kb frag- 
ment containing the mMT-l/hGH fusion flanked on both sides 
by hEPO sequences. This 'fragment (targeting fragment 2) 
contains no hEPO coding sequences, having only sequences 
lying between -l and approximately - 6620 upstream of the 
hEPO coding region to direct targeting to the human EPO 
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locus. Targeting fragment 2 is transfected into primary 
or secondary human skin fibroblasts using conditions 
similar to those described in Example lg. G418 -resistant 
colonies are picked into individual wells of 96 -well 
5 plates and screened for EPO expression by an ELISA assay 
(R&D Systems, Minneapolis, MN) . Cells in which the trans- 
fecting DNA integrates randomly into the human genome 
cannot produce EPO. Cells in which the transfecting DNA 
has undergone homologous recombination with the endogenous 
10 hEPO promoter and upstream sequences contain a chimeric 
gene in which the mMT-I promoter and non- transcribed 
sequences, hGH 5' untranslated sequences and hGh exon 1, 
and a 10 base pair linker comprised of the first 10 bases 
of hEPO intron 1 are inserted at the Hindlll site lying at 
15 position -620 relative to the hEPO coding region (see 
Figure 2) . The localization of the mMT-I promoter up- 
stream of the normally silent hEPO promoter will direct 
the synthesis, in primary or secondary skin fibroblasts, 
of a message reading (5' to 3') non- translated metallo- 
20 thionein and hGH sequences, hGH exon 1, 10 bases of DNA 

identical to the first 10 base pairs of hEPO intron 1, and 
the normal hEPO promoter and hEPO exon 1 (-620 to +13 
relative to the hEPO coding sequence) . The 10 base pair 
linker sequence from hEPO intron 1 acts as a splice-donor 
25 site to fuse hGH exon 1 to the next downstream splice 
acceptor site, that lying immediately upstream of hEPO 
exon 2. Processing of the resulting transcript will 
therefore splice out the hEPO promoter, exon 1, and intron 
1 sequences. The replacement of hEPO exon 1 with hGH exon 
1 results in a protein in which the first 4 amino acids of 
the hEPO signal peptide are replaced with amino acids 1-3 
of hGH, creating a functional, chimeric signal peptide 
which is removed by post -translation processing from the. 
mature protein and is secreted from the expressing cells. 
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A series of constructs related to pXEPO-10 and pXEPO- 
11 can be constructed, using known methods, m these 
constructs, the relative positions of the mMT-I promoter 
and hGH sequences, as well as the position at which the 
> mMT-I/hGH sequences are inserted into hEPO upstream se- 
quences, are varied to create alternative chimeric tran- 
section units that facilitate gene targeting, result in 
more efficient expression of the fusion transcripts, or 
have other desirable properties. Such constructs will 
give similar results, such that an hGH-hEPO fusion gene is 
Placed under the control of an exogenous promoter by gene 
targeting to the normal hEPO locus. For example, the 6 Jcb 
HmdIII-BamHI fragment upstream of the hEPO gene (See 
Example if) has numerous restriction enzyme recognition 
sequences that can be utilized as sites for insertion of 
the neo gene and the mMT-I promoter/hGH fusion fragment. 
One such site, a Bglli site lying approximately 1.3 kb 
upstream of the Hindlli site, is unique in this region and 
can be used for insertion of one or more selectable mark- 
ers and a regulatory region derived from another gene that 
will serve to activate hEPO expression in primary, second- 
ary, or immortalized human cells. 

First, the intermediate plasmid pT164 is constructed 
by inserting the 6 kb Hindlli-BamHI fragment (Example if, 
lying upstream of the hEPO coding region into Hindlll- 
BamHi digested pBluescriptll SK+ (Stratagene, LaJolla 
CA). Plasmid pMCineoPolyA [Thomas, K.R. and Capecchi' 
M.R Csli ^503-512 (i 987 > , available from Stratagene, 
LaJolla, CA] is digested with BamHI and Xhol, made blunt- 
ended by treatment with the Klenow fragment of E. coli DNA 
polymerase, and the resulting l.i kb fragment is purified 
PT1S4 is digested with Bgl'll and made blunt-ended by 
treatment with the Klenow fragment of E. coli DNA polymer- 
ase. The two preceding blunt-ended fragments are ligated 
together and transformed into competent E. coli. clones 



WO 95/31560 



PCT/US95/06045 



-74- 

with a single insert of the l.l kb neo .fragment are iso- 
lated and analyzed by restriction enzyme analysis to 
identify those in which the Bglll site recreated by the 
fusion of the blunt Xhol and Bglll sites is localized 
1.3 kb away from the unique Hindlll site present in plas- 
mid pT164. The resulting plasmid, pT165, can now be 
cleaved at the unique Bglll site flanking the 5' side of 
the neo transcription unit. 

Oligonucleotides 13.8 and 13.9 are utilized in poly- 
merase chain reactions to generate a fragment in which the 
mouse metallothionein I (mMT-I) promoter - hGH exon 1 
sequences are additionally fused to a 10 base pair frag- 
ment comprising a splice-donor site. The splice-donor 
site chosen corresponds to the natural hEPO intron 1 
splice-donor site, although a larger number of splice- 
donor sites or consensus splice-donor sites can be used. 
The oligonucleotides (13.8 and 13.9) are used to amplify 
the approximately 0.73 kb mMT-I promoter - hGH exon 1 
fragment from pXGH5 (Figure 5) . The amplified fragment 
(fragment 7) is digested with Bglll and ligated to Bglll 
digested pT165. The ligation mixture is transformed into 
E. coli and a clone, containing a single insert of frag- 
ment 7 in which the Kpnl site in the mMT- I . promoter is 
adjacent to the 5' end of the neo gene and the mMT-I 
promoter is oriented such that transcription is directed 
towards the unique Hindlll site, is identified and desig- 
nated pXEPO-12. 

13 • 8 5' AAA AAGATCT GGTACCT TGG TTTTTAAAAC CAGCCTGGAG 
Bglll Kpnl 
(SEQ ID NO 12) 

The non-boldface region of oligo 13.8 is identi- 
cal to the mMT-I promoter, with the natural Kpnl 
site as its 5' boundary. The boldface type 
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denotes a Bglli site tail to convert the 5' 
boundary to a Bglli site. 

13.9 5' TTTTASaiSI GAGTACTCAC CTGTAGCCAT TGCCGCTAGG 
- Bglli 

5 (SEQ ID NO 13} 

The boldface region of oligos 13.9 denote hGH 
sequences. The italicized region corresponds to 
the first 10 base pairs of hEPO intron l. The 
underlined Bglli site is added for plasmid con- 
10 struction purposes. 

Plasmid pXEPO-12 can be used for gene targeting by 
digestion with BamHI and Hindlli to release the 7.9 kb 
fragment containing the neo gene and the mMT-I/hGH fusion 
flanked on both sided by hEPO sequences. This fragment 
(targeting fragment 3) contains no hEPO coding sequences 
having only sequences lying between approximately - 620 and 
approximately - 6620 upstream of the hEPO coding region to 
direct targeting upstream of the human EPO locus. Target- 
ing fragment 3 is transfected into primary, secondary, or 
immortalized human skin fibroblasts using conditions 
similar to those described in Examples lb and ic. G418- 
resistant colonies are picked into individual wells of 96- 
well plates and screened for EPO expression by an ELISA 
assay (r&d Systems, Minneapolis MN) . Cells in which the 
transfecting DNA integrates randomly into the human genome 
cannot produce hEPO. Cells in which the transfecting DNA 
has undergone homologous recombination with the endogenous 
hEPO promoter and upstream sequences contain a chimeric 
gene in which the mMT-I promoter and non-transcribed 
30 sequences, hGH 5' untranslated sequences, and hGH exon 1, 
and a 10 base pair linker comprised of the first 10 bases 
of hEPO intron l are inserted at the Bglli site lying at 
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position approximately -1920 relative to the hEPO coding 
region. The localization of the mMT-I promoter upstream 
of the normally silent hEPO promoter will direct the 
synthesis, in primary, secondary, or immortalized human 
5 fibroblasts (or other human cells) , of a message reading: 
(5' to 3') nontranslated tnetallothionein and hGH sequenc- 
es, hGH exon i, 10 bases of DNA identical to the first 10 
base pairs of hEPO intron l, and hEPO upstream region and 
hEPO exon 1 (from approximately -1920 to +13 relative to 
10 the EPO coding sequence) . The 10 base pair linker se- 
quence from hEPO intron 1 acts as a splice-donor site to 
fuse hGH exon 1 to a downstream splice acceptor site, that 
lying immediately upstream of hEPO exon 2. Processing of 
the resulting transcript will therefore splice out the 
hEPO upstream sequences, promoter region, exon 1, and 
intron 1 sequences. When using pXEPO-10, -11 and -12, 
post -transcriptional processing of the message can be 
improved by using in vjtrp mutagenesis to eliminate splice 
acceptor sites lying in hEPO upstream sequences between 
the mMT-I promoter and hEPO exon l, which reduce level of 
productive splicing events needed create the desired 
message. The replacement of hEPO exon 1 with hGH exon 1 
results in a protein in which the first 4 amino acids of 
the hEPO signal peptide are replaced with amino acids 1-3 
25 of hGH, creating a functional, chimeric signal peptide 
which is removed by post -translation processing from the 
mature protein and is secreted from the expressing cells. 

EXAMPLE 3, TARGETED MODIFICATIO N OF SEOTTRWCES UPSTREAM 
AND AMPLIFICATION OP THE TAPOBTF.n figflE; 
30 Human cells in which the hEPO gene has been activated 

by the methods previously 'described can be induced to 
amplify the neo/mMT-i/EPO transcription unit if the tar- 
geting plasmid contains a marker gene that can confer 
resistance to a high level of a cytotoxic agent by the 
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phenomenon of gene amplification. Selectable marker genes 
such as dihydrofolate reductase {dhfr, selective agent is 
methotrexate) , the multifunctional CAD gene [encoding 
carbamyl phosphate synthase, aspartate transcarbamylase, 
> and dihydro-orqtase; selective agent is N- (phosphono- 

acetyl)-L-aspartate (PALA)], glutamine synthetase; selec- 
tive agent is methionine sulphoximine (MSX) , and adenosine 
deaminase (ada; selective agent is an adenine nucleoside) , 
have been documented, among other genes, to be amplifiable 
in immortalized human cell lines (Wright, J. A. fi£ ai. 
Proc, Natl, Acad. Sci , TTS^ £7;1791-179S (1990); Cockett, 
M.I. fit al- Bio/Technology 8; 662-667 (1990)). m these' 
studies, gene amplification has been documented to occur 
in a number of immortalized human cell lines. HT1080, 
HeLa, MCF-7 breast cancer cells, K-562 leukemia cells,' KB 
carcinoma cells, or 2780AD ovarian carcinoma cells, among 
other cells, display amplification under appropriate 
selection conditions. 

Plasmids pXEPO-10 and pXEPO-ll can be modified by the 
insertion of a normal or mutant dhfr gene into the unique 
Hxndlli sites of these plasmids. After transfection of 
HT1080 cells with the appropriate DMA, selection for G418- 
resxstance (conferred by the neo gene) , and identification 
of cells in which the hEPO gene has been activated by gene 
targeting of the neo, dhfr, and mMT-1 sequences to the 
correct position upstream of the hEPO gene, these cells 
can be exposed to stepwise selection in methotrexate (MIX) 
xn order to select for amplification of dhfr and co-ampli- 
fication of the linked neo, mMT-l, and hEPO sequences 
(Kaufman, R.j. Technique, 2;221-236 (1990)). A stepwise 
selection scheme in which cells are first exposed to low 
levels of MTX (0.01 to 0.08 fM) , followed by successive 
exposure to incremental increases in MTX concentrations up 
to 250 fiM MTX or higher is employed. Linear incremental 
steps of 0.04 to 0.08 (M MTX and successive 2-fold in- 
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creases in MTX concentration will be effective in select- 
ing for amplified transfected cell lines, although a 
variety of relatively shallow increments will also be 
effective. Amplification is monitored by increases in 
5 dhfr gene copy * number and confirmed by measuring in vitro 
hEPO expression. By this strategy, substantial over- 
expression of hEPO can be attained by targeted modifica- 
tion of sequences lying completely outside of the hEPO 
coding region. 

Constructs similar to those described (Examples If , 
lh, li, lk, 2 and 7) to activate hGH expression in human 
cells can also be further modified to include the dhfr 
gene for the purpose of obtaining cells that overexpress 
the hGH gene by gene targeting to non-coding sequences and 
subsequent amplification. 

EXAMPLE * TARGETING AND ACTIVATIO N OF THE HUMAN RPn 

JjOCUS IN AN IMMORTALIZED HUMAN FIBROBLAST LTWB 
The targeting construct pXEPO-13 was made to test the 
hypothesis that the endogenous hEPO gene could be activat- 
ed in a human fibroblast cell. First, plasmid pT22.1 was 
constructed, containing 63 bp of genomic hEPO sequence 
upstream of the first codon of the hEPO gene fused to the 
mouse metallothionein-1 promoter (mMT-I) . Oligonucleo- 
tides 22.1 to 22.4 were used in PCR to fuse mMT-I and hEPO 
sequences. The properties of these primers are as fol- 
lows: 22.1 is a 21 base oligonucleotide homologous to a 
segment of the mMT-I promoter beginning 28 bp upstream of 
the mMT-I Kpnl site; 22.2 and 22.3 are 58 nucleotide 
complementary primers which define the fusion of hEPO and 
mMT-I sequences such that the fusion contains 28 bp of 
hEPO sequence beginning 35 bases upstream of the first 
codon of the hEPO gene, and mMT-I sequences beginning at 
base 29 of oligonucleotide 22.2, comprising the natural 
Bglll site of mMT-I and extending 30 bases into mMT-I 
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sequence; 22.4 is 21 nucleotides in length and is homolo- 
gous to hEPO sequences beginning 725 bp downstream of the 
first codon of the hEPO gene. These primers were used to 
amplify a 1.4 kb DNA fragment comprising a fusion of mMT-I 
5 and hEPO sequences as described above. The resulting 
fragment was digested with Kpnl (the PGR fragment con- 
tained two Kpnl sites.- a single natural Kpnl site in the 
mMT-I promoter region and a single natural Kpnl site in 
the hEPO sequence) , and purified. The plasmid pXEPOl was 
10. also digested with Kpnl, releasing a 1.4 kb fragment and a 
6.4 kb fragment. The 6.4 kb fragment was purified and 
ligated to the 1.4 kb Kpnl PGR fusion fragment. The 
resulting construct was called pT22.l. a second interme- 
diate, pT22.2, was constructed by ligating the approxi- 
15 mately 6 kb Hindlll-BamHl fragment lying upstream of the 
hEPO structural gene (see Example if) to BamHl and Hindlll 
digested pBSIISK + (Stratagene, LaJolla, CA) . A third 
intermediate, pT22.3, was constructed by first excising a 
1.1 kb XhoI/BamHI fragment from pMCINEOpolyA (Stratagene,, 
20 LaJolla, CA) containing the neomycin phosphotransferase 
gene. The fragment was then made blunt-ended with the 
Klenow fragment of DNA polymerase I (New England Biolabs) 
This fragment was then ligated to the Hindi site of 
PBSIISK+ (similarly made blunt with DNA polymerase I) to 
25 produce pT22.3. A fourth intermediate, pT22.4, was made 
by purifying a l.i kb Xhol/Hindlll fragment comprising the 
neo gene from pT22.3 and ligating this fragment to Xhol 
and Hindlll digested pT22.2. pT22.4 thus contains the neo 
gene adjacent to the Hindlll side of the BamHI-Hindlll 
30 upstream hEPO fragment. Finally, pXEPO-13 was generated 
by first excising a 2.0 kb EcoRI/AccI fragment from pT22.- 
1. The EcoRI site of this 'fragment defines the 5' bound- 
ary of the mMT-I promoter, while the AccI site of this 
fragment lies within hEPO exon 5. Thus, the AccI/EcoRl 
fragment contains a nearly complete hEPO expression unit 
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missing only a parr of exon 5 and the natural polyadenyla- 
tion site. This 2.0 kb EcoRI/AccI fragment was purified, 
made blunt-ended by treatment with the Klenow fragment of 
DNA polymerase I, and ligated to Xhol digested, blunt- 
ended, pT22.4. 

HT1080 cells were transfected with Pvul-BaraHI digest- 
ed pXEPO-13. pXEPO-13 digested in this way generates 
three fragments; a 1 kb vector fragment including a por- 
tion of the amp gene, a 1.7 kb fragment of remaining 
vector sequences and an approximately 9 kb fragment con- 
taining hEPO, neo and mMT-I sequences. This approximately 
9 kb BamHI/PvuI fragment contained the following sequences 
in order from the BamHI site: an approximately 5.2 kb of 
upstream hEPO genomic sequence, the l.l kb neo transcrip- 
15 tion unit, the 0.7 kb mMT-I promoter and the 2.0 kb frag- 
ment containing hEPO coding sequence truncated within exon 
5. 45/tg of pEXPO-13 digested in this way was used in an 
electroporation of 12 million cells (electroporation 
conditions were described in Example lb) . This electro- 
poration was repeated a total of eight times, resulting in 
electroporation of a total of 96 million cells. Cells 
were mixed with media to provide a cell density of 1 
million cells per ml and l ml aliquots were dispensed into 
3 t0tal ° f 96 ' 150xm tissue culture plates (Falcon) each 
25 containing a minimum of 35 ml of DMEM/15% calf serum. The 
following day, the media was aspirated and replaced with 
fresh medium containing 0.8 mg/ml G418 (Gibco) . After 10 
days of incubation, the media of each plate was sampled 
for hEPO by ELISA analysis (R&D Systems) . Six of the 96 
30 plates contained at least 10 mU/ml hEPO. One of these 
plates, number 18, was selected for purification of hEPO 
expressing colonies. Each of the 96, 150 mm plates con- 
tained approximately 600 G418 resistant colonies (an 
estimated total of 57,600 G418 resistant colonies on all 
35 96 plates) . The approximately 600 colonies on plate 
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number IB were trypsins end rep lat ed «t so cells/ml 
into 364 well plates (sterilin, . After one week of iL- 
batron. single colonies were visible et approximately 10 
oolon.es per large well o£ the 364 well plates (these 
plates are comprised of l 6 small wells within each of the 
24 large wells, . Eaoh well was soreened for hEPO expres 

°JZ 'I f iS ^ - *«ge wells concaineTmedia 

wrth at least 20 mO/ml hEPO. Hell number *> wss found to 
concern 15 colonies distributed among the 16 small wells 
The contents of each of these small „ el ls were trypsinized 
and transferred to „ individual wells of a S6 weTplat. 
follows 7 days of incubation the media from each of 
these wells wss sampled for hEPO ELISA analysis. Only a 
smgle well, well number 10. contained hEPO. This cell 
stram was designated HT1S5-18A2-10 and was expanded in 

for quantitative hEPO analysis, eh* isola ti 0 » and 
D«* rsolatron. Quantitative measurement of hEPO produo- 

ctZZ ^ ln ' ^ ° £ 2 ' 5 °° ^"-its/million 
ceiis/24 hours. 

A °* 2 * Pr ° be exteadi »9 from the AccI site in 
hEPO exon 5 to the Bgln slte in the 3 , untranslated 
reg.cn was used to probe RNA isolated from HT16S-18A2-10 
thi 1' r targeting ^struct, pXEP0-13, truncated at 
the Accl sxte in exon 5 dees not contain these Accl/Bgln 

tlTZT^' theref ° re ' iS dia9nOStic for targeting at 

IT CUS - ^ 0611 StraiDS that haVe recomhined in 
a homologous manner with natural hEPO sequences would 

ZltTZT, 11250 " C ° ntainin9 86gUenCe h °- 10 ^ 8 to the 
Accl/Bglli sequences. HT165-18A2-10 was fcund to express 
an „ of the predicfced ei2e hybrid . 2ing ^ «~ 

labeled Accl/Bglll hEPO probe on Northern blots. Restric- 
tion enzyme and Southern blot analysis confirmed that the 
neo gene and mMT-I promoter were targeted to one of the 
two hEPO alleles in HT165-18A2-10 cells 
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These results demonstrate that homologous recombina- 
tion can be used to target a regulatory region to a gene 
that is normally silent in human fibroblasts, resulting i: 
the functional activation of that gene. 



5 22.1 5' CACCTAAAAT GATCTCTCTG G (SEQ ID NO 14) 

22.2 5' CGCGCCGGGT GACCACACCG GGGGCCCTAG ATCTGGTGAA 
GCTGGAGCTA CGGAGTAA {SEQ ID NO 15) 

22.3 5' TTACTCCGTA GCTCCAGCTT CACCAGATCT AGGGCCCCCG 
GTGTGGTCAC CCGGCGCG (SEQ ID NO 16) 

22.4 5' GTCTCACCGT GATATTCTCG G (SEQ ID NO 17) 

EXAMPLE 5^ PRODUCTION OF TNTPfiWT.ffg.Q r.ff ttfrc 

Gene targeting can also be used to produce a pro- 
cessed gene, devoid of introns, for transfer into yeast or 
bacteria for gene expression and in vitro protein produc- 
tion. For example, hGH can by produced in yeast by the 
approach described below. 

Two separate targeting constructs are generated. 
Targeting construct 1 (TCI) includes a retroviral LTR 
sequence, for example the LTR from the Moloney Murine 
Leukemia Virus (MoMLV) , a marker for selection in human 
cells (e.g., the neo gene from Tn5) , a marker for selec- 
tion in yeast (e.g., the yeast URA3 gene), a regulatory 
region capable of directing gene expression in yeast 
(e.g., the GAL4 promoter), and optionally, a sequence 
that, when fused to the hGH gene, will allow secretion of 
hGH from yeast cells (leader sequence) . The vector can 
also include a DNA sequence that permits retroviral pack- 
aging in human cells. The construct is organized such 
that the above sequences are flanked, on both sides, by 
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hGH genomic sequences which, upon homologous recombination 
with genomic hGH gene N sequences, will integrate the 
exogenous sequences in TCI immediately upstream of hGH 
gene N codon 1 (corresponding to amino acid position i in 
the mature, processed protein) . The order of DNA sequenc- 
es upon integration is: hGH upstream and regulatory se- 
quences, neo gene, LTR, ORA3 gene, GAL4 promoter, yeast 
leader sequence, hGH sequences including and downstream of 
ammo acid 1 of the mature protein. Targeting Construct 2 
(TC2) includes sequences sufficient for plasmid replica- 
tion in yeast (e.g., 2-micron circle or ARS sequences) a 
yeast transcriptional termination sequence, a viral LTR 
and a marker gene for selection in human cells (e.g., the 
bacterial gpt gene) . The construct is organized such that 
the above sequences are flanked on both sides by hGH 
genomic sequences which, upon homologous recombination 
with genomic hGH gene N sequences, will integrate the 
exogenous sequences in TC2 immediately downstream of the 
hGH gene N stop codon. The order of DNA sequences upon 
integration is: hGH exon 5 sequences, yeast transcription ' 
termination sequences, yeast plasmid replication sequenc- 
es, LTR, gpt gene, hGH 3' non- translated sequences. 

Linear fragments derived from TCI and TC2 are sequen- 
tially targeted to their respective positions flanking the 
hGH gene. After superinfection of these cells with helper 
retrovirus, LTR directed transcription through this region 
will result in an RNA with LTR sequences on both ends. 
Splicing of this RNA will generate a molecule in which the 
normal hGH introns are removed. Reverse transcription of 
the processed transcript will result in the accumulation 
of double-stranded DNA copies of the processed hGH fusion 
gene. DNA is isolated from the doubly- targeted, retro- 
virally-infected cells, and digested with an enzyme that 
cleaves the transcription unit once within the LTR. The 
digested material is ligated under conditions that promote 
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circularization, introduced into yeast cells, and the 
cells are subsequently exposed to selection for the URA3 
gene. Only cells which have taken up the URA3 gene 
(linked to the sequences introduced by TCI and TC2 and the 
processed hGH. gene) can grow. These cells contain a 
plasmid which will express the hGH protein upon galactose 
induction and secrete the hGH protein from cells by virtue 
of the fused yeast leader peptide sequence which is 
cleaved away upon secretion to produce the mature, biolog- 
ically active, hGH molecule. 

Expression in bacterial cells is accomplished by 
simply replacing, in TCI and TC2, the ampicillin-resis- 
tance gene from pBR322 for the yeast URA3 gene, the tac 
promoter (deBoer et al., Proc. Natl. Acad, Sci. 80:21-35 
(1983)) for the yeast GAM promoter, a bacterial leader 
sequence for the yeast leader sequence, the pBR322 origin 
of replication for the 2 -micron circle or ARS sequence, 
and a bacterial transcriptional termination (e.g., trpA 
transcription terminator; Christie, G.E. fit ai., Proc. 
Natl. Acad. Sci. 7R ah-ai (isai) ) sequence for the 
yeast transcriptional termination sequence. Similarly, 
hEPO can be expressed in yeast and bacteria by simply 
replacing the hGH targeting sequences with hEPO targeting 
sequences, such that the yeast or bacterial leader se- 
quence is positioned immediately upstream of hEPO codon 1 
(corresponding to amino acid position 1 in the mature 
processed protein) . 

EXAMPLE $. ACTIVATIO N AND AMPLIFICATION OF THE EPO GENE 
IN AN IMMORTAT.T ZED HUMAN CELL LINE 

Incorporation of a dhfr expression unit into the 
unique Hindlll site of pXEPO-13 (see Example 4} results in 
a new targeting vector capable of dual selection and 
selection of cells in which the dhfr gene is amplified. 
The single Hindlll site in pXEPO-13 defines the junction 
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of the neo gene and genomic sequence naturally residing 
upstream of the human EPO gene. Placement of a dhfr gene 
at this site provides a construct with the neo and dhfr 
genes surrounded by DNA sequence derived from the natural 
hEPO locus. Like pXEPO-13, derivatives with the dhfr gene 
inserted are useful to target to the hEPO locus by homolo- 
gous recombination. Such a construct designated pREP04, 
is represented in Figure 6. The plasmid includes exons'i- 
4 and part of exon 5 of the human EPO gene, as well as the 
Hindlll-BamHI fragment lying upstream of the hEPO coding 
region. pSVe, pTK and pmMT-I correspond to the promoters 
from the SV40 early region, the Herpes Simplex Virus (HSV) 
thymidine kinase (TK) gene and the mouse metallothionein-I 
gene, it was produced as follows: Hindlll-digested 
pXEPO-13 was purified and made blunt with the Klenow 
fragment of DNA polymerase I. To obtain a dhfr expression 
unit, the plasmid construct pP8CIS9080 (Eaton sL al. . 
Biochemistry 25:8343-8347 < 198e) , wg Rested with EcoRI 
and Sail. A 2 Kb fragment containing the dhfr expression 
unit was purified from this digest and made blunt with 
Klenow fragment of DNA polymerase I. This dhfr-containing 
fragment was then ligated to the blunted Hindlll site of 
PXEPO-13. An aliquot of this ligation was transformed 
into 1^ csli and plated on ampicillin selection plates. 
Following an overnight incubation at 37«>C, individual 
bacterial colonies were observed, picked and grown. 
Miniplasmid preparations were made from these cultures and 
the resulting DNA was then subjected to restriction enzyme 
digestion with the enzymes Bgll+Hindlll, and Sfil in order 
to determine the orientation of the inserted dhfr frag- 
ments. Plasmid DNA from one of these preparations was 
found to contain such a 2' Kb insertion of the dhfr frag- 
ment . The transcription orientation of the dhfr expres- 
sion unit in this plasmid was found to be opposite that of 
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the adjacent neo gene. This is the construct designated 
pREP04 . 

Plasmid pREP04 was used to amplify the hEPO locus in 
cells subsequent to activation of the endogenous hEPO gene 
by homologous recombination. Gene activation with this 
construct allows selection for increased DHFR expression 
by the use of the drug methotrexate (MTX) . Typically, 
increased DHFR expression would occur by an increase in 
copy number through DNA amplification. The net result 
would be co-amplification of the activated hEPO gene along 
with dhfr sequences. Co-amplification of the activated 
EPO locus should result in increased EPO expression. 

Targeting experiments were performed in HT1080 cells 
with pREP04. hEPO expressing line HTREPO-52 was isolated. 
This line was analyzed quantitatively for EPO production 
and by Southern and Northern blot. This strain was found 
to be targeted with a single copy of dhfr/neo/mMT-1 se- 
quences. Expression levels obtained under 0.8 mg/ml G418 
selection were approximately 1300 mU/million cells/day. 
Because the targeted EPO locus contained a dhfr expression 
unit, it was possible to select for increased expression 
of DHFR with the antifolate drug, MTX. This strain was 
therefore subjected to stepwise selection in 0.02, 0.05, 
0.1, 0.2 and 0.4 iM MTX. Results of initial selection of 
this strain are shown in Table 4 and Figure 7. 
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TABLE 4 



SslX Line 


MTX(nM) 


mU/ 

Million Cells/ 

24h 


52C20-5-0 


0 


1368 


52C20-5-.01 


0.01 


1744 


S2C20-5-.02 


0.02 


11643 


52C20-5-0.05 


0.05 


24449 


52-3-5-0.10 


°-l 


37019 


52-3-2-0.20 


0.2 


67867 


52-3-2-0. 4B 


0.4 


99919 



Selection with elevated levels of MTX was successful in 
increasing hEPO expression in line HTREPO-52, with a 70- 
fold increase in EPO production seen in the cell line 
resistant to 0.4 M M MTX. Confirmation of amplification of 
the hEPO locus was accomplished by Southern blot analysis 
in MTX-resistant cell lines, which revealed an approxi- 
mately 10-fold increase in the copy number of the activat- 
ed hEPO locus relative to the parental (untargeted) hEPO 
allele. 

EXAMPLE 7- PRODUCTION OF AN h*Pn pper ^ BBWR t» Y XN^gRTJON 

OF THE CMV promote* i a JJSSSBEW g£ I BE r~ 
NOMIC hEPO cxm-qm pF^™*. 

fionstrufft|on of t-arrretino nl^ j d pRBPm p . 

PREP015 was constructed by first fusing the CMV 
promoter to hGH exon 1 by PCR amplification. A 1.6 kb • 
fragment was amplified from hGH expression construct 
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pXGH308, which has the CMV promoter region beginning at 
nucleotide 546 and ending at nucleotide 2105 of Genbank 
sequence HS5MIEP fused to the hGH sequences beginning at 
nucleotide 5225 and ending at nucleotide 7322 of Genbank 
5 sequence HUMGHCSA, using oligonucleotides 20 and 35. 
Oligo 20 (35 bp, SEQ ID NO: 18), hybridized to the CMV 
promoter at -614 relative to the cap site (in Genbank 
sequence HEHCMVP1) , and included a Sail site at its 5' 
end. Oligo 35 (42 bp, SEQ ID NO: 19) , annealed to the CMV 
promoter at +966 and the adjacent hGH exon 1, and included 
the first 10 base pairs of hEPO intron 1 (containing a 
portion of the splice-donor site) and a Hindlll site at 
its 5' end. The resulting PCR fragment was digested with 
Hindlll and Sail and gel -purified. Plasmid pT163 (Example 
2) was digested with Xhol and Hindlll and the approxi- 
mately i.i kb fragment containing the neo expression unit 
was gel -purified. The 1.6 kb CMV promoter/hGH exon 
1/splice -donor site fragment and the 1.2 kb neo fragment 
were ligated together and inserted into the Hindlll site 
of pBSIISK+ (Stratagene, Inc.). The resulting intermedi- 
ate plasmid (designated pBNCHS) contained a neo expression 
unit in a transcriptional orientation opposite to that of 
the CMV promoter/hGH exon 1/splice-donor site fragment) . 
A second intermediate, pREPOSAHindlll, was constructed by 
first digesting pREP05 with Hindlll. This released two 
fragments of 1.9 kb and 8.7 kb, and the 8.7 Kb fragment 
containing EPO targeting sequences was gel purified and 
circularized by self -ligation. The resulting plasmid, 
pREPOSAHindlll, contained only non- coding genomic DNA 
sequences normally residing upstream of the hEPO gene . 
This included sequence from -5786 to -1 relative to EPO 
exon 1. The 2.8 kb fragment containing neo, the CMV 
promoter, hGH exon l, and the splice-donor site was ex-, 
cised from pBNCHS with Hindlll and gel-purified. This 
fragment was made blunt with the Klenow fragment of DNA 
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polymerase I (New England Biolabs, Inc.) and ligated to 
Bglll-digested and blunt-ended pREPOSAHindlll. Bgln cuts 
at a position -1779 bp upstream of hEPO exon 1 in 
PREPOSAHindlll. The resulting construct, pREPOlS (Figure 
5 8) , contained EPO upstream sequences from -5786 to -1779 
relative to the hEPO coding region, the neo expression 
unit, the CMV promoter, hGH exon 1, a splice-donor site, 
and sequences from -1778 to -1 bp upstream of the hEPO 
coding region, with the various elements assembled, in the 
10 order listed, 5- to 3' relative to nucleotide sequence of 
the hEPO upstream region. For transfection of human 
cells, pREPOlS was digested with Not I and Pvul to liber- 
ate an 8.6 kb targeting fragment. The targeting fragment 
contained first and second targeting sequences of 4.0 kb 
and 1.8 kb, respectively, with homology to DNA upstream of 
the hEPO gene. 
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Cell entire t ranqfpetinri , gad ^ ntjfH „ aM rm „ f ^ 
expressing t"~Teted ff1onoB . 

All cells were maintained at 37»C, 5% C0 2 and 98* 
humidity in DMEM containing io% calf serum (DMEM/10, 
Hycione Laboratories) . Transfection of secondary human 
foreskin fibroblasts was performed by electroporating 12 x 
10 cells in PBS (GIBCO) with 100 fig of DNA at 250 volts 
and 960 fiF. The treated cells were seeded at 1 x 10* 
25 cells per 150 mm plate. The following day, the media was 
changed to DMEM/10 containing 0.8 mg/ml G418 (GIBCO). 
Selection proceeded for 14 days, at which time the media 
was sampled for EPO production. All colonies on plates 
exhxbiting significant hEPO levels {> 5 mU/ml) as deter- 
30 mined by an EPO ELISA (Genzyme Inc.) were isolated with 
sterile glass cloning cylinders (Bellco) and transferred 
to individual wells of a 96 well plate. Following incuba- 
tion for 1-2 days, these wells were sampled for hEPO pro- 
duction by ELISA. Resulting hEPO-producing cell strains 
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were expanded in culture for freezing, nucleic acid isola- 
tion, and quantification of EPO production. 

Transfection of HT1080 cells (ATCC CCL 121) was 
performed by treating I2x 10« cells in PBS (GIBCO) with 45 
5 fig of DNA at 450 volts and 250 fiF. Growth and identifica- 
tion of clones occurred as for secondary human foreskin 
fibroblasts described above. Isolation of hEPO producing 
clonal cell lines occurred by limiting dilution. This was 
performed by first plating colonies harvested from the 
10 initial selection plates in pools of 10-15 colonies per 
well of a 24 well plate. hEPO producing pools were then 
plated at cell densities resulting in < 1 colony per well 
of a 96 well plate. Individual clones were expanded for 
further analysis as described for human foreskin fibro- 
15 blasts above. 

Characterization of epo e *rp-e SS j ng e i 

PREP015 is devoid of any hEPO coding sequence. Upon 
targeting of the neo/CMV promoter/hGH exon 1/splice-donor 
fragment upstream of hEPO exon 1, hEPO expression occurs 

20 by transcriptional initiation from the CMV promoter, 

producing a primary transcript that includes CMV sequenc- 
es, hGH exon 1 and the splice-donor site, 1.8 kb of up- 
stream hEPO sequences, and the normal hEPO exons, introns, 
and 3' untranslated sequences. Splicing of this tran- 

25 script would occur from the splice-donor site adjacent to 
hGH exon l to the next downstream splice -acceptor site, 
which is located adjacent to hEPO exon 2. Effectively, 
this results in a new intron consisting of genomic se- 
quence upstream of the hEPO gene, the normal hEPO promot- 

30 er, hEPO exon 1, and hEPO intron 1. In the mature tran- 
script, hGH exon 1 would replace hEPO exon 1. hEPO exon 1 
encodes only the first four and one -third amino acids of 
the 26 amino acid signal peptide, which is cleaved off of 
the precursor protein prior to secretion from the cell. 
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hGH exon 1 encodes the first three and one-third amino 
acids of the hGH signal peptide, which also is cleaved off 
of the precursor protein prior to secretion from the cell 
Translation of the message in which hGH exon 1 replaces 
5 hEPO exon 1 would therefore result in a protein in which 
the signal peptide is a chimera of hGH and hEPO sequence 
Removal .of the signal peptide by the normal post-transla- 
tional cleavage event will produce a mature hEPO molecule 
whose primary sequence is indistinguishable from the 
10 normal product. 

Transfection of pREPOlS into human fibroblasts re- 
sulted in EPO expression by these cells. Table 5 shows 
the. results of targeting experiments with pREPOlS in human 
fibroblasts and HT1080 cells. The targeting frequency in 
15 normal human fibroblasts was found to be 1/264 G418* 
colonies, and the targeting frequency with HT1080 cells 
was found to be 1/450 0418* colonies. hEPO production 
levels from each of these cell strains was quantified. An 
hEPO producer obtained from transfection of human fibro- 
blasts was found to be secreting 7,679 mU/ 10 s cells/ day 
(Table 5) . An activated hEPO cell line from HT1080 cells 
was producing 12,582 mU/io« cells/ day (Table 5). These 
results indicated that activation of the hEPO locus was 
efficient and caused hEPO to be produced constituitively 
5 at relatively high levels. Restriction enzyme and South- 
ern hybridization analysis was used to confirm that tar- 
geting events had occurred at the EPO locus. 

Southern blot analysis of the human fibroblast and 
HT1080 clones that were targeted with pREPOlS was per- 
formed. Figure 9A shows the restriction map of the 
parental and targeted hEPO locus, and Figure 9B shows the 
results of restriction enzyme and Southern hybridization 
analysis of a targeted human fibroblast clone. 
Bglll/EcoRI and BamHI digests revealed 5.9 and 6.6 kb 
fragments, respectively, as a result of a targeting event 
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at the hEPO locus (lanes Tl) . Both of these fragments 
resulted from the insertion of 2.7 kb of DNA containing 
the neo gene and CMV promoter sequences. Since only one 
of the two hEPO alleles were targeted, fragments of 4.3 kb 
5 (Bglll/EcoRI) or 10.6 kb (BamHI) reflecting the unaltered 
hEPO locus were seen in these strains and in parental DNA 
(lanes HP) . These results confirm that a homologous 
recombination event had occurred at the hEPO locus result- 
ing in the production of a novel transcription unit which 
10 directed the production of human erythropoietin. 

OliaonucleQtHHA Sequence 

5' TTTTCTCGAG TCGACGACAT TGATTATTGA CTAGT 

(SEQ ID NO: 18) 
5'TTTTAAGCTT GAGTACTCAC CTGTAGCCAT GGTGGATCCC GT 
(SEQ ID NO: 19) 
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Transfection of pREPOlS and Activation of hEPO 
Expression in Human Cells 



Cell Type 
Transa- 
cted 


Cells 
Treated 


•G418* 
Colonies 


b Plates 

With BPO jExpressors 


hEPO 

Expreasors 
per Treat- 
ed Cell 


L hZ9Q 

Expression 
(mU/10* 
cells/24 
hr) 


Human 
Fibro- 
blasts 


3.3 x 10 7 


264 


1 1/264 


1/3.3 x 10 7 


7679 


HT1080 
Cells 


3.1 x 10 7 


2700 


6 jl/450 


1/5.2 x 10*1112,582 



* estimated by counting colonies on 2 plates, averaeino the 
results and extrapolating to the total number If gates 

b rlT f r ? m P lates with G418 r colonies was sampled for EPO 

f ^nlf 1317818 S hose e ^iting hEPO levels greater than 
5 mU/ml were counted as EPO activation events greacer than 

C f?Soi? = t ^ Ve . hE ?° P roduc tion was determined from human 
^PolK-f-P 111 ' HF . 342 - 15 or HT 10 " cell line, 



EXAMPLE R ? 



PROPPCTION AND AMPT.TT7 T ^ TIOW op aw ^ EpQ m!iTnia 
QENE BY INSERT TOW nv n nm QMV promoter t , g p p 
UPSTREAM OF THE ttBWnMT Q hEPO mnTHP. p P «t^t 



. Construction Of targeting P 1 P , S mid nPttPma. 

PREP018 (Figure 10) was constructed by insertion of a 
dhfr expression unit at the Clal site located at the S' 
end of the neo gene of pREPOlS. To obtain a dhfr ex- 
pression unit, the plasmid construct pF8CIS9080 [Eaton sL 
aU, Biochemistry £5: 8343-8347 (1986)] was digested with 
EcoRI and Sail. A 2 kb fragment containing the dhfr 
expression unit was purified from this digest and made 
blunt by treatment with the Klenow fragment of DNA poly- 
merase I. A Clal linker (New England Biolabs) was then 
ligated to the blunted dhfr fragment. The products of 
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this ligation were then digested with Clal ligated to Clal 
digested pREP015 . An aliquot of this ligation was trans- 
formed into E. coli and plated on ampicillin selection 
plates. Bacterial colonies were analyzed by restriction 
5 enzyme digestion to determine the orientation of the 
inserted dhfr fragment. One plasmid with dhfr in a 
transcriptional orientation opposite that of the neo gene 
was designated pREP018(-) . A second plasmid with dhfr in 
the same transcriptional orientation as that of the aeo 
10 gene was designated pREP018 (+) 

Cell culture, fransf ection , and identic c ation of BPn 
expressing targeted clones; 

All cells were maintained at 37°C, 5% CO a , and 98* 
humidity in DMEM containing 10* calf serum (DMEM/10, 
15 HyClone Laboratories) . Transfection of HT1080 cells 

(ATCC, CCL 121) occurred by treating 12x 10* cells in PBS 
(GIBCO) with 45 /tg of DMA at 450 volts and 250 /*F. The 
treated cells were seeded at 1 x 10 s cells per 150 mm 
plate. The following day, the media was changed to 

20 DMEM/10 containing 0.8 mg/ml G418 (GIBCO). Selection 

proceeded for 14 days, at which time the media was sampled 
for hEPO production. Plates exhibiting significant hEPO 
production levels (> 5 mU/ml) as determined by an hEPO 
EL ISA (Genzyme Inc.) were trypsinized and the cells were 

25 re-plated for clone isolation. Isolation of hEPO produc- 
ing clonal cell lines occurred by limiting dilution, by 
first plating clones in pools of 10-15 colonies per well 
of a 24 well plate, and next plating cells from hEPO pro- 
ducing pools at cell densities resulting in less than 1 

30 colony per well of a 96 well plate. Individual clones 

were expanded in culture for freezing, nucleic acid isola- 
tion and quantification of hEPO production. 
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10 



Illation of ce1 1 s rontaining amplify dhfr ^ ^ ^ 

methotrexate B^Wf-i^p. 

Targeted 6418' cell lines producing hEPO following 
transfection with pREPOlS were plated at various cell 
densities for selection in methotrexate (MTX) . As new 
clones emerged following selection at one MTX concentra- 
tion, they were assayed for hEPO production and re-plated 
at various cell densities in a higher concentration of MTX 
(usually double the previous concentration) . This process 
was repeated until the desired hEPO production level was 
reached. At each step of MTX-resistance, DNA and RNA was 
isolated for respective southern and northern blot analy- 
sis. * 



15 



D 



. Characterization of epo P ^^i na rinnp B . 

pREPOlS, with two different orientations of dhfr, was 
transfected into HT1080 cells. Prior to transfection 
PREPOlS ( + ) and P REP018(-) were digested with Xbal, releas- 
ing a 7.9 kb targeting fragment containing, in the follow- 
ing order, a 2.1 kb region of genomic DNA upstream of hEPO 
exon 1 (from -3891 to -1779 relative to the hEPO ATG start 
codon), a 2 kb region containing the dhfr gene, a 1.1 kb 
region containing the neo gene, a 1.5 kb region contain- 
ing the CMV promoter fused to hGH exon 1, 10 bp of hEPO 
intron 1 (containing a splice-donor site) , followed by a 
1.1 kb region of genomic DNA upstream of hEPO exon 1 (from 
-1778 to -678 relative to the EPO ATO start codon) . 
Transfection and targeting frequencies from two experi- 
ments are shown in Table 6. Five primary 6418' clones 
were isolated from these experiments. These were expanded 
in culture for quantitative analysis of hEPO expression 
(Table 7) . As pREPOlS contained the dhfr gene, it is 
possible to select for cells containing amplified copies 
of the targeting construct using MTX as described-in 
Example 6. 6418' clones confirmed to be targeted to the 
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hEPO locus by restriction enzyme and Southern hybridiza- 
tion analysis were subjected to stepwise selection in MTX 
as described. 



Table 6: Targeting of pREP018 in HT1080 cells 



Construct 


DNA 
Digest 


Cells 
Treated 


G4ie r 

Colonies 


Plates 
With hEPO 
Bxpressors 


hEPO 

Expressora- 

/G418 r 

Colony 


Primary 

Clones 

Analyzed 


PREP018 

<-) 


Xbal 


36 x 10 k 


16,960 


39 


1/435 


1 


pREPOie 
M 


Xbal 


36 x 10> 


19,290 


41 


lMo 


4 



Table 7. hEPO production in HT1080 Cell lines targeted with 
pREPC-18 



Cell Line 


Construct 


IhEPO au/ld r 
Cells/24 hr 


18B3-147 


PREP018 (+) 




18B3-181 


pREPOlB U) 


20831 


18B3-145 


PREP018 (+) 


17586 


18B3-168 


PREP018 (+) 


5293 


18A3-119 


pREP018(-) 


2881 
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EXAMPLE 9; ACTIVATTON AND AMPT.TPTr^T OH np ^n^nnc 

y-iNTgRF^oM gm-^f, n-rgp and ESHfi g 

IMMORTAT.Tprpn piMAN rgT r T r g 
A wide variety of endogenous cellular genes can be 
activated and amplified using the methods and DNA con- 
structs of the. invention. The following describes a 
general strategy for activating and amplifying the human 
a-xnterferon (leukocyte interferon), GM-CSP (colony stimu- 
lating factor-granulocyte/macrophage) , G-CSF (colony 
stimulating factor-granulocyte) and FSH0 (follicle stimu- 
lating hormone beta subunit) genes. 

ff- interferon 

The human or-interferon gene (Genbank sequence 
HDMIFNAA) encodes a 188 amino acid precursor protein 
containing a 23 amino acid signal peptide. The gene 
contains no introns. Figure 11 schematically illustrates 
one strategy for activating the ^-interferon gene. The 
targeting construct is designed to include a first target- 
ing sequence homologous to sequences upstream of the gene 
an araplifiable marker gene, a selectable marker gene, a 
regulatory region, a CAP site, a splice-donor site, an • 
intron, a splice acceptor site, and a second targeting 
sequence corresponding to sequences downstream of the 
first targeting sequence. The second targeting sequence 
should not extend further upstream than to position -107 
relative to the normal start codon in order to avoid 
undesired ATG start codons. 

In this strategy the first and second targeting 
sequences are immediately adjacent to each other in the 
normal target gene, but this is not required (see below) . 
Amplifiable marker genes and selectable marker genes 
suitable for selection are described herein. The amplifi- 
able marker gene and selectable marker gene may be the 
same gene, their positions may be reversed, and one or 
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both may be situated in the intron of the targeting con- 
struct. A selectable marker gene is optional and the 
amplifiable marker gene is only required when amplifica- 
tion is desired. The incorporation of a specific CAP site 
5 is optional. Optionally, exon sequences from another gene 
can be included 3' to the splice-acceptor site and 5' to 
the second targeting sequence in the targeting construct. 
The regulatory region, CAP site, splice-donor site, 
intron, and splice acceptor site can be isolated as a 
10 complete unit from the human elongation factor-la (EF-la; 
Genbank sequence HUMEF1A) gene or the cytomegalovirus 
(CMV; Genbank sequence HEHCMVPl) immediate early region, 
or the components can be assembled from appropriate compo- 
nents isolated from different genes. 

Genomic DNA corresponding to the upstream region of 
the a- interferon gene for use as targeting sequences and 
assembly of the targeting construct can be performed using 
recombinant DNA methods known by those skilled in the art. 
As described herein, a number of selectable and amplifi- 
able markers can be used in the targeting constructs, and 
the activation and amplification can be effected in a 
large number of cell -types. Transfection of primary, 
secondary, or immortalized human cells and isolation of 
homologously recombinant cells expressing a-interferon can 
be accomplished using the methods described in Example 4, 
using an ELISA assay for human a-interferon (Biosource 
International, Camarillo, CA) . Alternatively, homo- 
logously recombinant cells may be identified by PCR 
screening as described in Example lg and lj. The isola- 
tion of cells containing amplified copies of the amplifi- 
able marker gene and the activated a-interferon locus is 
performed as described in Example 6. 

In the homologously recombinant cells, an mRNA pre- 
cursor is produced which includes the exogenous exon, 
splice-donor site, intron, splice -acceptor site, second 
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targeting sequence, and human a-interferon coding region 
and 3' untranslated sequences (Figure 11). Splicing of 
thxs message will generate a functional mRHA which can be 
translated to produce human a-interferon. 
5 The size of the intron and thus the position of the 

regulatory region relative to the coding region of the 
gene may be varied to optimize the function of the regula- 
tory region. Multiple exons may be present in the target- 
ing construct. In addition, the second targeting sequence 
does not need to lie immediately adjacent to or near the 
first targeting sequence in the normal gene, such that 
portions of the gene's normal upstream region are deleted 
upon homologous recombination. 

GM-CSF 

The human GM-CSF gene (Genbank sequence HDMGMCSFG) 
encodes a 144 amino acid precursor protein containing a 17 
amxno acid signal peptide. The gene contains four exons 
and three introns, and the N-terminal SO amino acids of 
the precursor are encoded in the first exon. Figure 12 
schematically illustrates a strategy for activating the 
GM-CSF gene, m this strategy the targeting construct is 
desxgned to include a first targeting sequence homologous 
to sequences upstream of the gene, an amplifiable marker 
gene, a selectable marker gene, a regulatory region, a CAP 
sxte, an exon which encodes an amino acid sequence which 
xs identical or functionally equivalent to that of the 
fxrst 50 amino acids of GM-CSF, a splice-donor site, and a 
second targeting sequence corresponding to sequences 
downstream of the first targeting sequence. By this 
strategy, homologously recombinant cells produce an mRNA 
precursor which corresponds to the exogenous exon and 
splxce-donor site, the second targeting sequence, any " 
sequences between the second targeting sequence and the 
start codon of the GM-CSF gene, and the exons, introns. 
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1 

and 3' untranslated region of the GM-CSF gene (Figure 11) . 
Splicing of this message results in the fusion of the 
exogenous exon to exon 2 of the endogenous GM-CSF gene 
which, when translated, will produce GM-CSF. 
5 In this strategy the first and second targeting 

sequences are .immediately adjacent in the normal target 
gene, but this is not required (see below) . Amplifiable 
marker genes and selectable marker genes suitable for 
selection are described herein- The amplifiable marker 
10 gene and selectable marker gene can be the same gene or 

their positions can be reversed. A selectable marker gene 
is optional and the amplifiable marker gene is only re- 
quired when amplification is desired. The selectable 
marker and/or amplifiable marker can be positioned between 
15 the splice-donor site and the second targeting sequence in 
the targeting construct. The incorporation of a specific 
CAP site is optional. The regulatory region, CAP site, 
and splice-donor site can be isolated as a complete unit 
from the human elongation factor- la (EF-la; Genbank se- 
quence HUMEF1A) gene or the cytomegalovirus (CMV; Genbank 
sequence HEHCMVPl) immediate early region, or the compo- 
nents can be assembled from an appropriate component 
isolated from different genes (such as the mMT-I promoter 
and CAP site, and exon 1 and a splice donor site from the 
hGH or hEPO genes. 

Other approaches can be employed, for example, the 
first and second targeting sequences can correspond to 
sequences in the first intron of the GM-CSF gene. Alter- 
natively, a targeting construct similar to that described 
for the a-interferon can be used, in which the targeting 
construct is designed to include a first targeting se- 
quence homologous to sequences upstream of the GM-CSF 
gene, an amplifiable marker gene, a selectable marker 
gene, a regulatory region, a CAP site, a splice-donor 
site, an intron, a splice acceptor site, and a second 
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targeting sequence corresponding to sequences downstream 
of the first targeting sequence. 

In any case the second targeting sequence does not 
need to lie immediately adjacent to or near the first 
targeting sequence in the normal gene, such that portions 
of the gene's normal upstream region are deleted upon 
homologous recombination. i„ addition, multiple 
non-coding or coding exons can be present in the targeting 
construct. Genomic DNA corresponding to the upstream or 
intron regions of the human GM-CSP gene for use as target- 
ing sequences and assembly of the targeting construct can 
be performed using recombinant DNA methods known by those 
skilled in the art. As described herein, a number of 
selectable and amplifiable markers can be used in the 
targeting constructs, and the activation can be effected 
in a large number of cell-types. Transfection of primary 
secondary, or immortalized human cells and isolation of 
homologously recombinant cells expressing GM-CSP can be 
accomplished using the methods described in Example 4 
using an ELISA assay for human GM-CSP (R&D Systems, Minne- 
apolis, MS) . Alternatively, homologously recombinant 
cells may be identified by pgr screening as described 
above. The isolation of cells containing amplified copies 
of the amplifiable marker gene and the activated GM-CSP 
25 locus is performed as described above. 

G-CSF 

The human G-CSF gene (Genbank sequence HDMGCSFG) 
encodes 204-207 amino acid precursor protein containing a 
30 amino acid signal peptide. The gene contains five 
0 exons and four introns. The first exon encodes 13 amino 
acids of the signal peptide. Figure 13 schematically 
illustrates a strategy for activating the G-CSP gene. The 
targeting construct is designed to include a first target- 
ing sequence homologous to sequences upstream of the gene 
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an amplifiable marker gene, a selectable marker gene, a 
regulatory region, a CAP site, an exon which encodes ^n 
amino acid sequence which is identical or functionally 
equivalent to that of the first 13 amino acids of the 
5 G-CSF signal peptide, a splice- donor site, and a second 
targeting sequence corresponding to sequences downstream 
of the first targeting sequence. By this strategy, homo- 
logously recombinant cells produce an mRNA precursor which 
corresponds to the exogenous exon and splice -donor site, 
10 the second targeting sequence, any sequences between the 
second targeting sequence and the start codon of the G-CSF 
gene, and the exons, introns, and 3' untranslated region 
of the G-CSF gene (Figure 13). Splicing of this message 
results in the fusion of the exogenous exon to exon 2 of 
the endogenous G-CSF gene which, when translated, will 
produce G-CSF- The ability to functionally substitute the 
first 13 amino acids of the normal G-CSF signal peptide 
with those present in the exogenous exon allows one to 
make modifications in the signal peptide, and hence the 
secretory properties of the protein produced. 

In this strategy the first and second targeting 
sequences are immediately adjacent in the normal target 
gene, but this is not required. The second targeting 
sequence does not need to lie immediately adjacent to or 
near the first targeting sequence in the normal gene, such 
that portions of the gene's normal upstream region are 
deleted upon homologous recombination. The amplifiable 
marker gene and selectable marker gene can be the same 
gene or their positions can be reversed. A selectable 
marker gene is optional and the amplifiable marker gene is 
only required when amplification is desired. The select- 
able marker and/or amplifiable marker can be positioned 
between the splice-donor site and the second targeting 
sequence in the targeting construct. The incorporation of 
a specific CAP site is optional. The regulatory region, 
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CAP site, and splice-donor site can be isolated as a 
complete unit from the human elongation factor-la (EF-la • 
Genbank sequence HUMEF1A) gene or the cytomegalovirus 
(CMV; Genbank sequence HEHCMVPi) immediate early region 
or the components can be assembled from an appropriate ' 
component isolated from different genes (such as the mMT-I 
promoter and CAP site, and exon 1 and a splice donor site 
from the hGH or EPO genes. Multiple exogenous exons, 
coding or non-coding, can be used in the targeting con- 
struct so long as an ATG start codon which, upon splicing, 
will be in-frame with the mature protein, is included in 
one of the exons. 

Other approaches may be employed, for example, the 
first and second targeting sequences can correspond to 
sequences in the first intron of the G-CSF gene. Alterna- 
tively, a targeting construct similar to that described 
for the ^-interferon can be used, in which the targeting 
construct is designed to include a first targeting se- 
quence homologous to sequences upstream of the G-CSF gene 
20 an amplifiable marker gene, a selectable marker gene, a 
regulatory region, a CAP site, a splice-donor site, an 
xntron, a splice acceptor site, and a second targeting 
sequence corresponding to sequences downstream of the 
first targeting sequence. 

Genomic DNA corresponding to the upstream or intron 
regions of the human G-CSF gene for use as targeting 
sequences and assembly of the targeting construct can be 
performed using recombinant DNA methods known by those 
skilled in the art. As described herein, a number of 
selectable and amplifiable markers can be used in the 
targeting constructs, and the activation can be effected 
in a large number of cell-types. Transfection of primary 
secondary, or immortalized human cells and isolation of 
homologously recombinant cells expressing G-CSF can be 
accomplished using the methods described in Example 4, 
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using an ELISA assay for human G-CSF (R&D Systems, Minne- 
apolis, MN) . Alternatively, homologously recombinant 
cells may be identified by PCR screening as described 
above. The isolation of cells containing amplified copies 
5 of the amplifiable marker gene and the activated 
a-interferon locus is performed as described above. 

FSHfi 

The human FSH/3 gene (Genbank sequence HDMFSH1) en- 
codes a 129 amino acid precursor protein containing a 16 
amino acid signal peptide. The gene contains three exons 
and two introns, with the first exon being a non-coding 
exon. The activation of FSH/S can be accomplished by a 
number of strategies. One strategy is shown in Figure 14. 
In this strategy, a targeting construct is designed to 
include a first targeting sequence homologous to sequences 
upstream of the gene, an amplifiable marker gene, a selec- 
table marker gene, a regulatory region, a CAP site, an 
exon, a splice-donor site, and a second targeting sequence 
corresponding to sequences downstream of the first target- 
ing sequence. By this strategy, homologously recombinant 
cells produce an mRNA precursor which corresponds to the 
exogenous exon and splice-donor site, the second targeting 
sequence, any sequences between the second targeting 
sequence and the start codon of the FSH0 gene, and the 
exons, introns, and 3' untranslated regions of the FSHjS 
gene (Figure 14) . Splicing of this message results in the 
fusion of the exogenous exon to exon 2 of the endogenous 
FSH0 gene which, when translated, can produce FSHjS. In 
this strategy the first and second targeting sequences are 
immediately adjacent in the normal target gene, but this 
is not required (see below) . 

Other approaches can be employed, for example, the . 
first and second targeting sequences can correspond to 
sequences in the first intron of the FSH/3 gene. Alterna- 
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tively, a targeting construct similar to that described 
for the a-interferon can be used, m this strategy the 
targeting construct is designed to include a first target- 
lag sequence homologous to sequences upstream of the FSHB 
5 gene, an amplifiable marker gene, a selectable marker 
gene, a regulatory region, a CAP site, a splice-donor 
s^e, an intron, a splice acceptor site, and a second 
targeting sequence corresponding to sequences downstream 
of the first targeting sequence. The second targeting 
10 sequence should not extend further upstream than to posi- 
tion -40 relative to the normal FSH/J transcriptional start 
site an order to avoid undesired ATO start codons. m the 
homologously recombinant cells, an mRNA precursor is 
produced which includes the exogenous exon, splice-donor 
site, intron, splice-acceptor site, second targeting 
sequence, and human FSH/J coding exons, intron and 3' un- 
translated sequences. Splicing of this message will 
generate a functional mRNA which can be translated to 
produce human PSH/J. The size of the intron and thus the 
position of the regulatory region relative to the coding 
region of the gene can be varied to optimize the function 
of the regulatory region. 

In any activation strategy, the second targeting 
sequence does not need to lie immediately adjacent to or 
near the first targeting sequence in the normal gene, such 
that portions of the gene's normal upstream region are 
deleted upon homologous recombination. Furthermore, one 
targeting sequence can be upstream of the gene and one may 
be within an exon or intron of the FSH/J gene. 
30 The amplifiable marker gene and selectable marker 

gene can be the same gene, their positions can be 
reversed, and one or both can be situated in the intron of 
the targeting construct. Amplifiable marker genes and 
selectable marker genes suitable for selection are de- 
scribed herein. A selectable marker gene is optional and 
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the amplifiable marker gene is only required when amplifi- 
cation is desired. The incorporation of a specific CAP 
site is optional. Optionally, exon sequences from another 
gene can be included 3' to the splice-acceptor site and 5' 
to the second targeting sequence in the targeting con- 
struct. The regulatory region, CAP site, exon, 
splice-donor site, intron, and splice acceptor site can be 
isolated as a complete unit from the human elongation 
factor-la (EF-la; Genbank sequence HUMEF1A) gene or the 
cytomegalovirus (CMV; Genbank sequence HEHCMVP1) immediate 
early region, or the components can be assembled from 
appropriate components isolated from different genes. In 
any. case, the exogenous exon can be the same or different 
from the first exon of the normal FSH/3 gene, and multiple 
exons can be present in the targeting construct. 

Genomic DNA corresponding to the upstream region of 
the FSH0 gene for use as targeting sequences and assembly 
of the targeting construct can be performed using recombi- 
nant DNA methods known by those skilled in the art. As 
described herein, a number of selectable and amplifiable 
markers can be used in the targeting constructs, and the 
activation can be effected in a large number of 
cell-types. If desirable, the product of the activated 
FSH0 gene can be produced in a cell type that expresses 
the human glycoprotein a-subunit, the product of which 
forms a heterodimer with the product of the FSH0 gene. 
This may be a naturally occurring cell strain or line. 
Alternatively, the human glycoprotein a-subunit gene 
(Genbank sequence HUMGLYCA1) can be co-expressed with the 
product of the FSH/5 gene, with such co-expression accom- 
plished by expression of the human glycoprotein a-subunit 
gene or cDNA under the control of a suitable promoter, or 
by activation of the human glycoprotein a-subunit gene . 
through the methods described herein. Transfection of 
primary, secondary, or immortalized human cells and isola- 
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txon of homologously recombinant cells expressing FSH0 can 
be accompli shed using the methods described above using an 
ELISA assay for human FSH/J (Accurate Chemical and Scien- 
tific Westbury, NY) . Alternatively, homologously recom- 
binant cells may be identified by pgr screening as de- 
scribed above, The isolation of cells containing ampli- 
fied copies of the amplifiable marker gene and the acti- 
vated ^-interferon locus is performed as described above. 

Those skilled in the art will recognize, or be able 
to ascertain using not more than routine experimentation, 
n»any equivalents to the specific embodiments of the inven- 
tion described herein. Such equivalents are intended to 
be encompassed by the following claims. 
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CLAIMS 

1. A DNA construct capable of altering the expression of 
a targeted gene when inserted into chromosonal DNA of 
a cell comprising; 

(a) a targeting sequence; 

(b) a regulatory sequence; 

(c) an exon; and 

(d) an impaired splice -donor site. 

2. The DNA construct of Claim 1 wherein the exon com- 
prises a CAP site. 

3. The DNA construct of Claim 2 wherein the exon further 
comprises the nucleotide sequence ATG. 

4. The DNA construct of Claim 3 wherein the exon further 
comprises encoding DNA which is in- frame with the 
targeted gene. 

5. The DNA construct of Claim 4 wherein the encoding DNA 
of the exon is the same as the encoding DNA of the 
first exon of the targeted gene. 

6. The DNA construct of Claim 4 wherein the encoding DNA 
of the exon is different from the encoding DNA of the 
first exon of the targeted gene. 

7. The DNA construct of Claim 4 wherein the targeting 
sequence is homologous to a sequence within the 
targeted gene. 

8 . The DNA construct of Claim 4 wherein the targeting 
sequence is homologous to a sequence upstream of the 
coding region of the targeted gene. 
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10. 



11. 



The DNA construct of Claim 4 wherein the targeting 
sequence is homologous to a sequence upstream of the 
endogenous regulatory sequence of the targeted gene. 

The DNA construct of claim 4 wherein the construct 
further comprises a second targeting sequence homolo- 
gous to a sequence within the targeted gene. 

The DNA construct of claim 4 wherein the construct 
further comprises a second targeting sequence homolo- 
gous to a sequence upstream of the coding region of 
the taropforf ru»,~ 



10 the targeted gene 

12. 



15 



20 



25 



The DNA construct of Claim 4 wherein the construct 
further comprises a second targeting sequence homolo- 
gous to a sequence upstream of the endogenous regula- 
tory sequence of the targeted gene. 

13. The DNA construct of Claim 4 wherein the targeted 
gene encodes a therapeutic protein. 

14. The DNA construct of Claim 4 wherein the targeted 
gene encodes a hormone, a cytokine, an antigen, an 
antxbody, an enzyme, a clotting factor, a transport 
protein, a receptor, a regulatory protein, a struc- 
tural protein or a transcription factor. 

15. The DNA construct of Claim 4 wherein the targeted 
gene encodes a protein selected from the group con- 
sistxng of erythropoietin, calcitonin, growth hor- 
mone, insulin, insulinotropin, insulin-like .growth 
factors, parathyroid 'hormone, ^-interferon, Y -inter- 
feron, nerve growth factors, FSH0, TGF-/J, tumor • 
necrosis factor, glucagon, bone growth factor-2, bone 
growth factor-7, TSH-/?, interleukin 1, interleukin 2, 
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interleukin 3,' interleukin 6, interleukin 11, inter- 
leukin 12, CSF-granulocyte, CSF- macrophage, CSF- 
granulocyte/ macrophage, immunoglobulins, catalytic 
antibodies, protein kinase C, glucocerebrosidase, 
superoxide dismutase, tissue plasminogen activator, 
urokinase, antithrombin III, DNAse, a-galactosidase, 
tyrosine hydroxylase, blood clotting factor V, blood 
clotting factor VII, blood clotting factor VIII, 
blood clotting factor IX, blood clotting factor X, 
blood clotting factor XIII, apolipoprotein E or 
apolipoprotein A- 1, globins, low density lipoprotein 
receptor, IL-2 receptor, IL-2 antagonists, alpha-1 
antitrypsin, immune response modifiers, and soluble 
CD4. 

The DNA construct of Claim 15 wherein the targeted 
gene encodes growth hormone, FSH/8, G-CSF or GM-CSF. 

The DMA construct of Claim 15 wherein the targeted 
gene encodes erythropoietin. 

The DNA construct of Claim 17 wherein the encoding 
DNA of the exon is the same as the encoding DNA of 
the first exon of erythropoietin. 

The DNA construct of Claim 17 wherein the encoding 
DNA of the exon is different from the encoding DNA of 
the first exon of erythropoietin. 

The DNA construct of Claim 19 wherein the encoding 
DNA of the exon is the same as the encoding DNA of 
the first exon of human growth hormone. 
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22. 



23 



0 

25 
26 



The DNA construct of Claim 1 wherein the regulatory 
sequence is a promoter, an enhancer, a scaffold- 
attachment region or a transcription factor binding 

The DNA construct of Claim 21 wherein the regulatory 
sequence is a promoter. 

The DNA construct of Claim 22 further comprising an 
additional regulatory sequence. 

24. The DNA construct of Claim 22 wherein the construct 
. further comprises an enhancer. 

The DNA construct of Claim 24 further comprising one 
or more selectable markers. 

The DNA construct of claim 25 further comprising an 
amplifiable marker gene. 

The DNA construct of Claim 21 wherein the regulatory 
sequence is a regulatory sequence of the mouse 
metallothionein-I gene, a regulatory sequence of an 
SV-40 gene, a regulatory sequence of a cytomegalo- 
virus gene, a regulatory sequence of a collagen gene, 
a regulatory sequence of an actin gene, a regulatory 
sequence of an immunoglobulin gene, a regulatory 
sequence of the HMG-CoA reductase gene or a regulato- 
ry sequence of the EF-ior gene. 



28. 



A method of making a homologously recombinant cell 
wherein the expression of a targeted gene is altered 
comprising the steps of: 

(a) transfecting a cell with a DNA construct, the 
DNA construct comprising: 
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(i) a targeting sequence; 

(ii) a regulatory sequence; 

(iii) an exon; and 

(iv) an unpaired splice-donor site, thereby 
- producing a transfected cell; and 

(b) maintaining the transfected cell under condi- 
tions appropriate for homologous recombination. 

29. The method of Claim 28 wherein the exon comprises a 
CAP site. 



10 30. The method of Claim 29 wherein the exon comprises the 
nucleotide sequence ATG. 

31. The method of Claim 30 wherein the exon further 
comprises encoding DNA in- frame with the targeted 
gene . 

32. The method of Claim 31 wherein the encoding DNA of 
the exon is the same as the encoding DNA of the first 
exon of erythropoietin. 

33. The method of Claim 31 wherein the encoding DNA of 
the exon is different from the encoding DNA of the 
first exon of erythropoietin. 

34. The method of Claim 31 wherein the targeting sequence 
is homologous to a sequence within the targeted gene. 

35. The method of Claim 31 wherein the targeting sequence 
is homologous to a sequence upstream of the coding 
region of the targeted gene. 
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36. 



37. 



The method of Claim 31 wherein the targeting sequence 
is homologous to a sequence upstream of the endoge- 
nous regulatory sequence of the targeted gene. 

The method of Claim 31 wherein the construct further 
comprises a second targeting sequence homologous to a 
sequence within the targeted gene. 



38. 



The method of Claim 31 wherein the construct further 
comprises a second targeting sequence homologous to a 
sequence upstream of the coding region of the target- 
10 ed gene. 



39. 



The method of Claim 31 wherein the construct further 
comprises a second targeting sequence homologous to a 
sequence upstream of the endogenous regulatory se- 
quence of the targeted gene. 

15 40. The method of Claim 31 wherein the cell is a human 
cell. 



41. 



42, 

20 . 



43 



25 44. 



The method of Claim 28 wherein the targeted gene 
encodes erythropoietin. 

The method of Claim 31 wherein the encoding DNA is 
the same as the encoding DNA of the first exon of 
erythropoietin. 

The method of Claim 31 wherein the encoding DNA is 
different from the encoding DNA of the first exon of 
erythropoietin. 

The method of Claim 31 wherein the encoding DNA of 
the exon is the same as the encoding DNA of the first 
exon of human growth hormone. 
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45. The method of Claim 28 further comprising the step 
of : 

(c) maintaining a homologously recombinant cell from 
step (b) under conditions appropriate for pro- 
duction of a protein. 

46. The method of Claim 45 in which the gene whose ex- 
pression is altered is the erythropoietin gene. 

47. Erythropoietin produced by the method of Claim 45. 

48. A fusion protein containing amino acids encoded by 
exons from the DNA construct and amino acids encoded 
by an endogenous gene produced by the method of Claim 
45. 

49. A fusion protein of Claim 48 wherein the endogenous 
gene is erythropoietin. 

50. A fusion protein of Claim 49 comprising amino acids 
1-3 of human growth hormone and amino acids 6-165 of 
human erythropoietin. 

51. A homologously recombinant cell produced by the 
method of Claim 28. 

52. A homologously recombinant cell produced by the 
method of Claim 29. 

53. A homologously recombinant cell produced by the 
method of Claim 30. 

54. A homologously recombinant cell produced by the 
method of Claim 31. 
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55 . A homologously recombinant cell produced by the 
method of Claim 32. 

56. A homologously recombinant cell produced by the 
method of Claim 33. 

5 57. a homologously recombinant cell produced by the 
method of Claim 40. 



58. a homologously recombinant cell produced by the 
method of Claim 41. 

59. a homologously recombinant cell produced by the 
10 method of Claim 42. 

60. A homologously recombinant cell produced by the 
method of Claim 44. 

61. A homologously recombinant cell comprising an exoge- 
nous regulatory sequence, an exogenous exon and a 
splice-donor site, operatively linked to the second 
exon of an endogenous gene. 

«. The homologously recombinant cell of Claim 61 wherein 
the exogenous exon comprises a CAP site. 

63. The homologously recombinant cell of Claim 62 wherein 
the exogenous exon further comprises the nucleotide 
sequence ATG. 



64 



The homologously recombinant cell of Claim 63 wherein 
the exogenous exon further comprises encoding DNA in- 
frame with the targeted endogenous gene 
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65. The homologously recombinant cell of Claim 64 wherein 
the encoding DNA is the same as the encoding DNA of 
the first exon of the targeted gene. 

66. The homologously recombinant cell of Claim 64 wherein 
the encoding DNA is different from the encoding DNA 
of the first exon of the targeted gene. 

67. The homologously recombinant cell of Claim 64 wherein 
the exogenous regulatory sequence, exogenous exon and 
splice-donor site are upstream of the coding region 
of the targeted gene. 

68. The homologously recombinant cell of Claim 67 wherein 
the exogenous regulatory sequence, exogenous exon and 
splice -donor site are upstream of the endogenous 
regulatory sequence of the targeted gene. 

69. The homologously recombinant cell of Claim 61 wherein 
the endogenous regulatory sequence is deleted. 

70. The homologously recombinant cell of Claim 69 wherein 
the first endogenous exon is deleted. 

71. The homologously recombinant cell of Claim 64 wherein 
the targeted gene encodes a hormone, a cytokine, an 
antigen, an antibody, an enzyme, a clotting factor, a 
transport protein, a receptor, a regulatory protein, 

a structural protein or a transcription factor. 

72. The homologously recombinant cell of Claim 64 wherein 
the targeted gene encodes a protein selected from the 
group consisting of erythropoietin, calcitonin, 
growth hormone, insulin, insulinotropin, insulin- 
like growth factors, parathyroid hormone, /9- inter- 



WO 95/31560 



PCT/US95/06045 



-117- 



10 



feron, Y-interferon, nerve growth factors, FSH0, TGF- 
fi. tumor necrosis factor, glucagon, bone growth 
factor-2, bone growth factor-7, TSH-/3, interleukin l 
interleukin 2, interleukin 3, interleukin 6 
interleukin 11, interleukin 12, CSF-granulocyte, CSF- 
macrophage, CSF-granulocyte/ macrophage, 
immunoglobulins, catalytic antibodies, protein kinase 
C glucocerebrosidase, superoxide dismutase, tissue 
plasminogen activator, urokinase, anti thrombin in 
DNAse, or-galactosidase, tyrosine hydroxylase, blood 
clotting factor V, blood clotting factor VII, blood 
clotting factor VIII. blood clotting factor IX, blood 
. dotting factor X, blood clotting factor XIII, apoli- 
poprotein E or apolipoprotein A-I, globins. low 
density lipoprotein receptor, IL-2 receptor, il-2 
antagonists, alpha-l antitrypsin, immune response 
modifiers, and soluble CD4. 

73. The homologously recombinant cell of Claim 61 wherein 
the cell is a eukaryote. 

20 74. The homologously recombinant cell of Claim 73 wherein 
the cell is of fungal, plant or animal origin. 

75. The homologously recombinant cell of Claim 74 wherein 
the cell is of vertebrate origin. 

76. The homologously recombinant cell of Claim 75 wherein 
the cell is a primary or secondary mammalian cell. 

77. The homologously recombinant cell of Claim 75 wherein 
the cell is a primary or secondary human cell. 

78. The homologously recombinant cell of claim 75 wherein 
the cell is an immortalized mammalian cell. 
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79. The homologously recombinant cell of Claim 75 wherein 
the cell is an immortalized human cell. 

80. The homologously recombinant cell of Claim 75 wherein 
the cell is selected from the group consisting of: 
HT1080 cells, HeLa cells and derivatives of HeLa 
cells, MCF-7 breast cancer cells, K-562 leukemia 
cells, KB carcinoma cells, 2780AD ovarian carcinoma 
cells, Raji cells, Jurkat cells, Namalwa cells, HL-60 
cells, Daudi cells, RPM1 8226 cells, U-937 cells, 
Bowes Melanoma cells, WI-38VA13 subline 2R4 cells, 
and MOLT- 4 cells, 

L. The homologously recombinant cell of Claim 80 wherein 
the targeted gene encodes erythropoietin. 

• • The homologously recombinant cell of Claim 81 capable 
of expressing erythropoietin. 

i. The homologously recombinant cell of Claim 82 wherein 
the encoding DNA is the same as the encoding DNA of 
the first exon of erythropoietin. 

. The homologously recombinant cell of Claim 81 wherein 
the encoding DNA is different from the encoding DNA 
of the first exon of erythropoietin. 

. The homologously recombinant cell of Claim 84 wherein 
the encoding DNA is the same as the encoding DNA of 
the first exon of human growth hormone. 

. The homologously recombinant cell of Claim 61 capable 
of expressing a fusion protein comprising amino acids 
encoded by the exogenous exon and amino acids encoded 
by the endogenous gene. 
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A fusion protein of Claim 86 wherein the endogenous 
gene is erythropoietin. 



A fusion protein of Claim 87 comprising amino acids 
1-3 of human growth hormone and amino acids 6-165 of 
5 human erythropoietin. 



89 



3 90. 



91. 



92. 



The homologously recombinant cell of Claim 66 wherein 
the regulatory sequence is a promoter, an enhancer, a 
scaffold-attachment region or a transcription factor 
binding site. 

The homologously recombinant cell of Claim 89 wherein 
the exogenous regulatory sequence is a promoter. 

The homologously recombinant cell of Claim 89 wherein 
the exogenous regulatory sequence is a regulatory 
sequence of the mouse metallothionein-I gene, a 
regulatory sequence of an SV-40 gene, a regulatory 
sequence of a cytomegalovirus gene, a regulatory 
sequence of a collagen gene, a regulatory sequence of 
an act in gene, a regulatory sequence of an immuno- 
globulin gene, a regulatory sequence of the HMG-CoA 
reductase gene or a regulatory sequence of the EF-i<* 
gene. 

A method of altering the expression of a gene in a 
cell, comprising the steps of: 

(a) transfecting a cell with a DNA construct, the 
DNA construct comprising: 

(i) a targeting sequence; 

(ii) a regulatory sequence; 

(iii) an exon; and 

(iv) an unpaired splice -donor site, thereby 
producing a transfected cell; 
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(b) maintaining the transfected cell under condi- 
tions appropriate for homologous recombination, 
thereby producing a homologously recombinant 
cell; and 

(c) maintaining the homologously recombinant cell 
under conditions appropriate for expression of 
the gene. 

93 . The method of Claim 92 wherein the exon comprises the 
nucleotide sequence ATG. 

94. The method of Claim 92 wherein the exon further 
comprises a CAP site. 

95. The method of Claim 94 wherein the exon further 
comprises encoding DNA which is in-frame with the 
targeted gene. 

96. The method of Claim 95 wherein the encoding DNA is 
the same as the encoding DNA of the first exon of the 
targeted gene. 

97. The method of Claim 96 wherein the targeted gene is 
the erythropoietin gene. 

98. The method of Claim 96 wherein the encoding DNA is 
different from the encoding DNA of the first exon of 
the targeted gene. 

99. The method of Claim 98 wherein the targeted gene is 
the erythropoietin gene. 

100. The method of Claim 98 wherein the targeting sequence 
is homologous to a sequence within the targeted gene. 
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101. The method of Claim 98 wherein the targeting sequence 
is homologous to a sequence upstream of the coding 
region of the targeted gene. 

102. The method, 0 f claim 98 wherein the targeting sequence 
18 homol °SFOus to a sequence upstream of the endoge- 
nous regulatory sequence for the targeted gene. 

103. The method of Claim 98 wherein the construct further 
comprises a second targeting sequence homologous to a 
sequence within the targeted gene. 

10 104. The method of Claim 98 wherein the construct further 
comprises a second targeting sequence homologous to a 
sequence upstream of the coding region of the target- 
ed gene. s 



15 



105. The method of Claim 98 wherein the construct further 
comprises a second targeting sequence homologous to a 
sequence upstream of the endogenous regulatory se- 
quence for the targeted gene. 

106. The method of Claim 92 further comprising the step 



of: 

20 



(O maintaining a horaologously recombinant cell 

under conditions appropriate for production of a 
protein. 

107. The method of Claim 106 in which the gene whose ex- 
pression is altered is the erythropoietin gene. 

25 108. Erythropoietin produced by the method of claim 107. 
109. A fusion protein produced by the method of Claim 106. 
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110. A fusion protein of Claim 109 wherein the endogenous 
gene is erythropoietin. 

111. A fusion protein of Claim 110 comprising amino acids 
1-3 of human growth hormone and amino acids 6-165 of 
human erythropoietin. 

112. A method of making a protein by altering the expres- 
sion of a gene in a cell, comprising the steps of: 

(a) transfecting a cell with a DNA construct, the 
DNA construct comprising: 

(i) a targeting sequence; 

(ii) a regulatory sequence; 

(iii) an exon; and 

(iv) an unpaired splice-donor site, thereby 
producing a transfected cell; 

(b) maintaining the transfected cell under condi- 
tions appropriate for homologous recombination, 
thereby producing a homologously recombinant 
cell; and 

(c) maintaining the homologously recombinant cell 
under conditions appropriate for production of 
the protein. 

113. The method of Claim 112 wherein the exon comprises a 
CAP site. 

114. The method of Claim 113 wherein the exon comprises 
the nucleotide sequence ATG. 

115. The method of Claim 114 wherein the exori further 
comprises encoding DNA which is in- frame with the 
targeted endogenous gene. 
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116. The method of Claim lis wherein the encoding DNA is 
the same as the encoding DNA of the first exon of the 
targeted gene. 

117. The method of Claim ll 6 wherein the encoding DNA is 
different from the encoding DNA of the first exon of 
the targeted gene. 

118. The method of Claim 117 wherein the targeting se- 
quence is homologous to a sequence within the target- 
ed gene. 

10 119. The method of claim 117 wherein the targeting se- 
quence is homologous to a sequence upstream of the 
coding region of the targeted gene. 

The method of Claim 117 wherein the targeting se- 
quence is homologous to a sequence upstream of the 
endogenous regulatory sequence for the targeted gene. 

The method of Claim 117 wherein the construct further 
comprises a second targeting sequence homologous to a 
sequence within the targeted gene. 

122. The method of Claim 117 wherein the construct further 
comprises a second targeting sequence homologous to a 
sequence upstream of the coding region of the target- 
ed gene. 

123. The method of Claim 117 wherein the construct further 
comprises a second targeting sequence homologous to a 
sequence upstream of the endogenous regulatory se- 
quence for the targeted gene. 

124. An erythropoietin produced by the method of Claim 112. 



120. 

15 

121. 
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125. The erythropoietin of Claim 124 wherein the cell is 
of human origin . 

126. A protein produced by the method of Claim 112. 

127. The protein of Claim 126 which is a fusion protein. 

128. The fusion protein of Claim 127 wherein the endoge- 
nous gene is the erythropoietin gene. 

129. The fusion protein of Claim 128 comprising amino 
acids 1-3 of human growth hormone and amino acids 6- 
165 of human erythropoietin. 

130. The DNA plasmid pREP018. 

131. A DNA construct capable of altering the expression of 
a targeted gene when inserted into the chromosomal 
DNA of a cell, comprising: 

(a) a targeting sequence; 

(b) a regulatory sequence; 

(c) an exon; 

(d) a splice-donor site; 

(e) an intron; and 

(f) a splice -acceptor site. 

132. The DNA construct of Claim 131 wherein the targeting 
sequence is homologous to a sequence within the 
targeted gene. 

133. The DNA construct of Claim 131 wherein the targeting 
sequence is homologous to a sequence upstream of the 
coding region of the targeted gene. 
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134. The DNA construct of claim 131 wherein the targeting 
sequence is homologous to a sequence upstream of the 
endogenous regulatory sequence of the targeted gene. 

135. The DNA construct of Claim 131 wherein the construct 
further comprises a second targeting sequence homolo- 
gous to a sequence within the targeted gene. 



136 



The DNA construct of claim 131 wherein the construct 
further comprises a second targeting sequence homolo- 
gous to a sequence upstream of the coding region of 
10 the targeted gene. 

137. The DNA construct of Claim 131 wherein the construct 
further comprises a second targeting sequence homolo- 
gous to a sequence upstream of the endogenous regula- 
tory sequence of the targeted gene. 

15 138. A homologously recombinant cell comprising a regula- 
tory sequence, an exon, a splice-donor site, an 
intron and a splice-acceptor site introduced by 
homologous recombination upstream of the coding 
region of a targeted gene. 

20 139. The homologously recombinant cell of Claim 138 where- 
in the targeted gene is the a-interferon gene. 

140. The homologously recombinant cell of Claim 138 where- 
in the targeted gene is the erythropoietin gene. 

141. A homologously recombinant cell comprising the dhfr 
gene, the neo gene, the CMV promoter, hGH exon 1 and 
an unpaired splice-donor site targeted to a position 
upstream of the endogenous erythropoietin regulatory 
region. ~ * 
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142. The homologously recombinant cell of Claim 141 pro- 
duced by the integration of DNA from pREP018. 

143. A method of making a homologously recombinant cell 
wherein the expression of a targeted gene is altered, 
comprising the steps of: 

(a) transfecting a cell with a DNA construct, the 
construct comprising: 

(i) a targeting sequence; 

(ii) a regulatory sequence; 

(iii) an exon; 

(iv) a splice-donor site; 

(v) an intron; and 

(vi) a splice -acceptor site; 

wherein the targeting sequence directs the inte- 
gration of elements (b)-(f) upstream such that 
they are opera tively linked to the first exon of 
a targeted gene; and 
(b) maintaining the transfected cell under condi- 
tions appropriate for homologous recombination. 

144. A homologously recombinant cell produced by the 
method of Claim 143. 

145. A method of altering the expression of a gene in a 
cell, comprising the steps of: 

(a) transfecting a cell with a DNA construct, the 
construct comprising: 

(i) a targeting sequence; 

(ii) a regulatory sequence; 

(iii) an exon; 

(iv) a splice-donor site; 

(v) an intron; and 

(vi) a splice-acceptor site; 
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(b) 



(C) 



wherein the targeting sequence directs the inte- 
gration of elements (b) - (f ) upstream such that 
they are operatively linked to the first exon of 
a targeted gene; 

maintaining the transfected cell under condi- 
tions appropriate for homologous recombination; 



and 



30 



maintaining the homologously recombinant cell 
under conditions appropriate for expression of 
10 the gene. 

146. A method of making a protein by altering the expres- 
. sion of a gene in a cell, comprising the steps of- 
(a) transfecting a cell with a DNA construct, the 
construct comprising: 
15 (i) a targeting sequence; 

(ii) a regulatory sequence; 

(iii) an exon; 

(iv) a splice-donor site; 

(v) an intron; and 

(vi) a splice -acceptor site; 

wherein the targeting sequence directs the inte- 
gration of elements (b) - (f ) upstream such that 
they are operatively linked to the first exon of 
a targeted gene; 

maintaining the transfected cell under condi- 
tions appropriate for homologous recombination- 
and 

maintaining the homologously recombinant cell 
under conditions appropriate for expression of 
the protein. 



20 



25 . (b) 



(c) 



147. The method of claim 146 wherein the targeted gene is 
the a-interferon gene or the erythropoietin gene. 
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148. DNA sequences located between about 5 kilobases and 
3 0 kilobases upstream of the ATG of the erythropoie- 
tin gene. 

149. A method for targeting the erythropoietin gene in a 
mammalian cell comprising transfecting the cell with 
a construct comprising a DNA sequence homologous to a 
sequence upstream of the sequence ATG of the erythro- 
poietin gene. 

150. The method of Claim 149 wherein the construct com- 
prising a DNA sequence homologous to a sequence 

- located between about 5 kilobases and 30 kilobases 
upstream of the sequence ATG of the erythropoietin 
gene. 

151. The method of Claim 150 wherein the mammalian cell is 
a human cell. 

152. A method for targeting the erythropoietin gene in a 
mammalian cell comprising transfecting the cell with 
a construct comprising a DNA sequence homologous to a 
sequence within the erythropoietin gene. 

153. The method of Claim 152 wherein the mammalian cell is 
a human cell. 
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